


Institutional Archive of the Naval Postgraduate School 


Calhoun: The NPS Institutional Archive 
DSpace Repository 


Theses and Dissertations 1. Thesis and Dissertation Collection, all items 


1970 


Application of a data structuring concept ina 
general-purpose fact-retrieval system. 


Petrucci, Richard Joseph 


Monterey, California. Naval Postgraduate School 


http://ndl.handle.net/10945/14887 


Downloaded from NPS Archive: Calhoun 


Calhoun is the Naval Postgraduate School's public access digital repository for 


f (8 D U DLEY research materials and institutional publications created by the NPS community. 
«ist : Calhoun is named for Professor of Mathematics Guy K. Calhoun, NPS's first 





KNOX appointed — and published — scholarly author. 


INN LIBRARY Dudley Knox Library / Naval Postgraduate School 


411 Dyer Road / 1 University Circle 
Monterey, California USA 93943 





http://www.nps.edu/library 





eee —— 






APPLICATION OF A DATA STRUCTURING CONCEPT 
IN A GENERAL-PURPOSE FACT-RETRIEVAL SYSTEM 


by 


Richard Joseph Petrucci 














United States 
Naval Postgraduate School 


Poe AIO pA A STRUCTURING CONCEPT 


IN A GENERAL-PURPOSE FACT-RETRIEVAL SYSTEM 


by 


Richard Joseph Petrucci 





September 1970 


This document has been approved for public re- 
Lease and sale; 4t8 distribution 16 unlincted. 


T1356608 





= wy 


[BRARY 2s | 
AVAL POSTGRADUATE SCHOOL 


ONTEREY, CALIF. 93940 


Application of a Data Structuring Concept 


in a General-Purpose Fact-Retrieval System 


by 


Pacha ndeloseprn Iotrucct 
Lieutenant Commander, United States Navy 
B.S., United States Naval Academy, 1961 


Submitted in partial fulfillment of the 
requirements for the degree of 


MASTER OF SCIENCE IN COMPUTER SCIENCE 


from the 


NAVAL POSTGRADUATE SCHOOL 
September 1970 





ABSTRACT 


An on-line, general-purpose, fact-retrieval system is presented 
which employs a classificatory data structuring technique. The 
technique embraces the basic concept of hierarchical classification 
of data and provides users with multiple avenues of access to a data 


file. Additionally, the data file may be partitioned into unrelated 


data sets. 








II. 


Ty 


my. 


Tabi OF CONTENTS 


oO Pin vielNIDr XING --..--- -------------~------- 


Hy ACE TE 1S 1B) TSDC ee 


ieee COM Oni rt@ Nib XING ---------.---.......--._-.. 


Pe eee erin tO) N--=-----2----.------..----.--- 
Pec cat@entialvenoamization 2-2-5. .--2.2.5.-...-.-=- 
eee Or) CORN eerie ar ape a SS 

ee earn Omer ee et ek 
SPC aUGENelMNO 2 — = 2 so DS ce lee ee 
eee te Ue NG1NG 28255255 5-5-.--4--..-.-.-.+-- 
nea epee rare Sree aie SSE +--+ 

Pee: Meneses Dan NG bl weet aoa a + 

2, QI QUIN GV Ly SOUR Gol Se 

eRe er ew Crea a eli a eee os oe ee -K 

Ob ete tem r oo oH ARCHING —2-----.-..-..-.--.-.- 

Pee @nieglwrnorom ARCH PLANS ---2--5.--.--.-.---..- 


ore eee eye eon eel ett a a om oe oe ee Se 


Nez, 


13 


16 


17 


lis 


19 


21 


ZO 


23 


Ze 


24 


Zz) 


28 


28 


30 


31 


32 


32 


2 


32 


34 





WAL 


VI. 


| 
! 


eee @ ler Rib y Al, ----------~----~--.---.... 34 
Ree Gel ite Rim yA, ----=2---.-.-----------...- a3 
ae Sen OP albu lee ee ee eee Bie 


DATA STRUCTURE FORA FACT-RETRIEVAL SYSTEM -- 37 


A. DATA STRUCTURE ----------- a ee ate 2 38 
1. Class Structure Representation ------------------ 38 
2. Data Representation ---------------------------- 4) 
3. System Utility --------------------------------- 4] 
| EU SUUE SL, 12 RUC) CID = a 4? 
1. Query Format -------------------------------- 42 
2. Boolean Expressions -------------------------- 43 
3. Alphabetic and Numeric Ranges ---------------- 44 
SYSTEM STRUCTURE --------------------------------- 45 
A. DATA FILES ------------------------------------- 45 
B. TREE-TYPE DATA STRUCTURES ----------------- 46 
1. Data Cells ------------------------------------ 46 
2. Structuring Process --------------------------- 47 
C. INDEX FILES ----------------------------------- 50 
1. Characteristics of the Master Index ------------- Bye, 
2. Constructing the Master Index ------------------ ee 
3. Data Record Table ----------------------------- 57 
D. INFORMATION FILE ------------------------------ 59 
E. RETRIEVAL PROCESSOR ------------------------- 61 





ee eG oe eee 5 61 


amemcrcr mining Mata Base Partitions __..___.____.____ aa 

b. Determining Format and Class Definitions ____-_- 62 

CH MUA oTIe NEN etrieval -------.----+.-.-6.-...- 62 

PD AbcmitcmomGsuetiieval—- 2’ ----<- 22-1222 525.4.-- 65 

Pe Semi Mme A PA BASE ~=------...---.-.....-.- Gu 
1. Changes and Wolebtonaier 9 oo 2 2] sae 55.4.2... K 67 
Zeendditions ==——------------.-.. ~~. - +--+ An 


APPENDIX A A LISTING OF SAMPLE RUNS MADE WITH 
THE GENERAL-PURPOSE FACT-RETRIEVAL 


ts See =~ =~ 2 -- =e 70 
eT PROGRAM —-------.-------.-...._-.-.----_-...- 26 
BIBLIOGRAPHY -------------------------------------------- 92 
Pea DISTRIBUTION LIST -----------------..---.-.--....- 94 
FORM DD 1473 -------------------------------------------- 95 











Figure 


ee 


rs 


Eis tsOr FIGURES 


BASIC FLOW DIAGRAM OF THE INFORMATION 
ees cote ErO Woo = —=——-- 55-2 - eee s- eet 


CONVERSION OF VARIABLE-LENGTH RECORDS 
TO FIXED-LENGTH RECORDS USING THE 
Bei GLinG mse HNi@UE 2---.--5-.--..-.--+---.-- 


ADDITION OF RECORDS TO AN EXISTING 
BRANCHING STRUCTURE ------------------------- 


PARENTHESIZED CLASS EXPRESSIONS AND 
ooo HD TREE SIRUCTURE FOR THE 
HIERARCHICAL CLASSIFICATION OF DATA -------- 


Dewi mGIsinlwCOMmPOSITION =-=-------=-.-..--..---- 
iV besteuwGhuekh COMPOSED OF DATA CELLS --- 


TREE STRUCTURE FOR THE FORMAT: 
aio wer ROGORDM!=---<--9-------.+----+---.- 


REPRESENTATION OF THE MASTER INDEX-------- 


RESERVED RECORD IN THE MASTER INDEX 
FOR FORMAT NAMES WITH ASSOCIATED DATA 
CELLS AND FORMAT NAMES IN SEQUENTIAL 
Se ae ete ee ee i= ------- 


REPRESENTATION OF THE DATA RECORD 
i le ee a 


RELATIONSHIP BETWEEN THE MASTER INDEX, 
DATA RECORD TABLE, AND SEQUENTIAL 
Eu sree ra Saye ie cinre a. -------- - 


48 


SS 











I. INTRODUCTION 


The term "information retrieval"! and the initials "IR'' were 
coined by the editors of Fortune about ten years ago. However, 
Vannevar Bush first formally declared the necessity for an informa- 
tion retrieval discipline in his ''As We May Think" article which was 
written for Atlantic Monthly in 1946. The United States Government 
and those people involved in Library Science were truly the first in- 
novators of this discipline in the mid-fifties. The technological ex- 
plosion being felt at that time prompted government agencies and 
library scientists to search for more efficient systems for indexing, 
storing, and retrieving documents. Primary concern was the assur- 
ance that vital technical information would be available to all possible 
users. The discipline of information retrieval as we know it today 
emerged as a result of this technological explosion. 

Information retrieval has been defined in numerous ways. How- 
ever, all definitions share a common point which is best stated by 
Taube [Ref. l] as: ''The right information made available to the right 
person at the right time.'' Bourne [Ref. 2] states that 'Information 
retrieval has become a generic term, firmly established through 
common usage, which includes reference, fact, and document retrie- 
val.'"' Bourne also differentiates between data processing and informa- 
tion retrieval. The former includes the manipulation, replacement, 


alteration, or addition to the data on file while the later is concerned 





with the storage of data in unaltered form for later re-use. Use of 
the term ‘information retrieval" in this paper implies the generic 
meaning stated by Bourne. 

This paper is devoted to the investigation of a data structuring 
concept proposed by Kildall [Ref. 3] for use ina general-purpose 
fact-retrieval system. Before investigating Kildall's proposal in 
section VI, the techniques of indexing, storage, and retrieval estab- 
lished for Library Science purposes will be reviewed. These basic 
techniques form a foundation for the design of specific IR systems. 


Information retrieval is divided into three major operatives: 


1. Indexing (classification, description, and structuring of 
information sources). 


2. Storage (organization and storage of files). 
3. Retrieval (searching and displaying information). 

Figure 1 is a simplified diagram which illustrates a typical 
information retrieval process. An index is constructed which de- 
scribes the information source (document or record) and is stored in 
a file along with the source itself. A request for information (query) 
is directed to the index file where the location of the requested docu- 
ment within the information file is found. A search of the information 
file then results in the retrieval of the document. This process is 
analogous to the indexing and storing of new books received ina 


library, and the search for information by a library patron. 


10 














INFORMATION 
SOURCE 


INDEXING 
PROCESS 


Index Record 


INFORMATION 
INDEX 
FILE FILE 
Storage 


Information 





USER 


Figure 1] 


Basic Flow Diagram of the Information Retrieval Process 


11 





Ii, INDEXING 


Indexing is the classification, description, and structuring of 
information in such a manner that retrieval of the information is 
accomplished expeditiously. This task is performed on information 
sources such as books, documents, and files and is an integral part 
of the information retrieval process. Since retrieval is the counter- 
part of indexing, the indexing and retrieval schemes used inanIR 
system must be compatible in order for a user to communicate with 
the system. Clearly, retrieval efficiency (i.e., ease and speed of 
retrieving desired information with a minimum of false drops!) is 
related to the efficiency and consistency of the indexing process. 

As arule, the information base of an IR system is specialized 
and as such requires a professional jargon. Ideally, the indexer and 
system user are experts in this professional language. However, this 
may not necessarily be true and causes a problem commonly confronted 
by IR system designers. The problem is how to structure specialized 
data for input to the system in a manner that is convenient to both the 
indexer and user while maintaining data accessibility. An example of 
an indexing language is the Dewey Decimal System used for indexing 
library books. 


Selection of an indexing language is based upon the following 


considerations: 


1 Output of irrelevant information as a result of a retrieval 
request is called a ''false drop. "' 


12 





1. The language should be convenient to use, such as natural 
language or a language that could be easily learned. 


| 2. Computerized systems require that the language be rigid 
enough to be usable in the machine but must also remain convenient 


for human utility. 


3. The vocabulary should be broad enough to allow accurate 
description of the information. 


4. The language should be flexible enough to allow modification 
as changes in information occur. 


There are numerous indexing languages in use today each 
tailored to suit specific usage of the IR system. Therefore, indexing 
languages normally reflect the viewpoint of the system designer in 
his attempt to organize the system's data base to best suit the needs 
of the user. Several indexing techniques which evolved from Library 
Science will be reviewed in the sections that follow. These techniques 
appear to form the nucleus from which specialized systems are formed. 
Although the techniques are primarily oriented toward document in- 
dexing, variations are used in all types of IR systems. The techniques 
are presented in ascending order of: 

1. Effort on the part of the indexer. 

2. Difficulty in automating. 

3. Indexing power 


4. Retrieval efficiency. 


igh UNIT-TERM INDEXING 


The simplest indexing technique involves the extraction of 


descriptive words from the information source. The source is then 


13 





associated with each of the terms used to describe its content. In 

the case of a library book, or other document, descriptive words 

may be taken from the title, abstract, or the text itself. This tech- 
nique requires a minimum of effort (other than reading the source) 

on the part of the indexer. In addition, the indexing is accomplished 
rather quickly since the indexer need not be ultimately familiar with 
the subject material. Unit-term indexing is particularly advantageous 
when no information is available on the spread of subject material. 
The addition of new material to the data base is easily accomplished 
by expanding the vocabulary (unit-terms) to include new descriptive 
words. However, unit-term indexing lacks rules for combining terms 
into units which have meaning. This shortcoming causes indexing 
problems when synonyms, plural word forms, and generically related 
terms are encountered in the source document. 

The search device used in such a system is an alphabetical 
listing (indexing record) of the key words used by the indexer. In 
general, the information source is’ listed with each key word and 
is used as a source descriptor, or the listing may indicate the location 
of the source, or both. It is possible that the user will have difficulty 
in using this system unless he knows precisely the topic that he is 
searching for. An analogy may be drawn to searching the telephone 
book for a name when the spelling of the name is not known. There- 


fore, this indexing scheme is often utilized in IR systems where the 


user is familiar with the professional jargon contained in the 


14 





information sources (e.g., technical libraries). 

An excellent eample of this subject-indexing“ technique is the 
Uniterm Coordinate Indexing System which dates back to 1952. The 
Uniterm ("'unit-term'"') System includes fifteen rules governing the 
indexer's operation, rules for determining key words, methods for 
processing word meanings, and cross-referencing techniques. Some 
agencies using this system have drafted standard unit-terms (key 
words) to be used by indexers. However, this is unnecessary for an 
unstructured language since new unit-terms may be added without 
perturbing the existing system. An example of an index that might 
be constructed from a Uniterm System is shown below. The numbers 
below the unit-terms might represent reference serial numbers, or 
library call numbers. 

ABLATION 
ADGmoOto™ Lic 
ADSORPTION 

137 459 823 1201 
ADHESIVE 

aot 
AERODYNAMIC 


139 241 242 357 552 1010 1168 


"Subject indexing, '' 'keyword indexing, '' and ''coordinate 
indexing'' are terms commonly used to describe the technique presented 


here. 


15 





B. KEY -WORD-IN-CONTEXT INDEXING 

, Another very common subject indexing technique is called ''Key- 

| . 
Weerd_tn-Context" (KWIC) indexing?. The indexing power of KWIC 
is very slightly greater than the simplest of subject indexing techniques 
since the key word is shown in the context of the entire subject. There 
are several variations in KWIC format but essentially it is an alphabet- 
ical listing of key words. Whole phrases are extracted from the 
source so that a user can easily determine the role of the key word. 
The distinguishing feature of KWIC is its displa: format shown in the 
example below. Let us suppose that the title of a source document 
is: "Principles of Automated Inf. mation Retrieval. '' Assuming that 
the indexer selects four key words to describe the source, the KWIC 
index would appear as: 

"5135 Principles of AUTOMATED Information Retrieval 
iples of Automated INFORMATION Retrieval 5135 Princ 
ion Retrieval 5135 PRINCIPLES of Automated Informat 
omated Information RETRIEVAL 5135 Principles of Aut" 

Mote that “automated”, “information”, “principles"',and retrieval” 
are individual key words. A user desiring this source document 
could find it by using any one of the four key words. Note also that a 


user may find this system easier to use than the Uniterm System if he 


is unfamiliar with the subject material. 


> Also referred to as "permuted" or "permuted title'' indexing. 


16 





ce fii sAURUS 
Indexing power may be increased further by determining generic 
relationships between key words. The Armed Forces Information 
Agency (ASTIA) and the Defense Documentation Center (DDC) have 
produced thesauri which are alphabetical lists of indexing terms with 
related terms and ''see'' references. These lists are used by indexers 
as means of standardizing their operation. In other words, indexers 
describe similar information sources in consistent fashion. These 
thesauri define some hierarchy in key words and are useful to the 
user as well as indexer since they allow the user to formulate queries 
with the exact terms used by the indexer. An example of a thesaurus 
borrowed from Meadow [Ref. 4] is exhibited below. 
COMPUTERS 
(Computers and Data Systems) 
Includes: 
Calculating machines 
Generic to: 
ANALOG COMPUTERS 
ANALOG-DIGITAL COMPUTERS 


BOMBING COMPUTERS 


e 


Also see: 


17 





Dae | ROCK SOING SYSTEMS 


SIMULATION 


Computing gun sights use GUN SIGHTS 


D. HIERARCHICAL CLASSIFICATION 

Probably the most widely used indexing technique is that of 
hierarchical classification where a universe of information is repeated- 
ly divided and sub-divided into a classificatory tree. This index 
language has a very tightly controlled but simple vocabulary contained 
in an authority list of key words provided with the classification system. 
Each key word in the authority list is assigned a numeric or alphanumeric 
code (mnemonic codes could be used but normally are not). As can be 
seen in the tree structure exhibited below, a key word contains all 
those key words generic to it (i.e., above it in the branch of the tree 
from which it was derived). Hierarchical schemes allow the indexer 
to describe an information source in generic levels so that the user 
may formulate his query in more general or more specific terms by 
moving up or down the classification tree. 

Modification of key word meaning is difficult to accomplish 
since changing one word in the tree affects all key words generic to 
it. However, changes at the bottom of the tree are easily made since 


no perturbation of the tree occurs. Expansion of the vocabulary used 


18 


in this sytem is readily accomplished by expanding the tree horizon- 
tally. 

The most well known hierarchical systems are the Dewey 
Decimal Classification System (exhibited below), the Library of 


Congress System, and the Universal Decimal Classification System. 


@ x 
so a 
7 XN 
\ ya nae 
on Ce ne 
\ ‘ 
yo a 
519. 92 
500 Pure Science 
510 Mathematics 
519 Probabilities and Statistical Mathematics 
519.9 Treatment of Data 
Dido c Programming (linear and dynamic) 


FS FACETED INDEXING 
In the immediately preceding section a classification technique 
was presented which structures a topic (universe of information) by 


dividing and subdividing it to forma classificatory tree. Faceted 


indexing deals with individual key words taken from the data source 


Wy, 


and grouped into categories with respect to their usage within the 
source. Terms within each group are structured into a classificatory 
tree. A term extracted from the source is analyzed from several 
points of view and a group of indexing terms are synthesized to de- 
scribe the key word in context. This technique is referred to as 
"facet analysis, '' 'faceted indexing, '' and "relational indexing" where 
each key word's point -of-view-analysis is called a facet. 

An excellent example of faceted indexing is given by Meadow 
[Ref. 4]. Let us suppose that "steel'' is a key word taken froma 
source document. The document contains information relating to the 
manufacture, use, chemical analysis, and properties of steel. By 
appending descriptors to the key word ''steel'' the following index 
terms are created: 

STEEL, manufacture of 


STEEL, use in automobiles 


These index terms are not predefined in any authority list but 
are constructed by the indexer by appending descriptors to the key 
word. The terms follow some Syntactic rule suchas: subject followed 
by modifier, followed by operation modifier. The utility of this 
technique is that the indexer, armed with a descriptor list and syn- 


tactic rules tailored to suit the particular IR system, may analyze 


20 





a source from many points of view and construct index terms that 


describe the information content in great detail. 


iP. Pm LOMATIC INDEXING 

In the foregoing discussions, it was assumed that the indexer 
was human. A treatment of automatic (computer) indexing is now in 
order. | 

Automatic indexing is difficult to accomplish for two main rea- 
sons. First, the information source must be in machine readable 
form. In the case of books or other lengthy documents this is a very 
expensive requirement. However, development of character recog- 
nition devices and the production of transcripts in machine code as a 
by-product of automatic typesetting have eased the cost of this require- 
ment. The second problem, and the more serious, is the development 
of algorithms or heuristics which derive meaning from strings of 
characters. This is an area of Artificial Intelligence in which a good 
deal of research has been expended. However, the results of this 
research have been empirical since we lack sophisticated linguistic 
and semantic knowledge. References 5, 6, 7, and 8 contain excellent 
treatments of the research conducted and problems involved in machine 
translation of natural language while ref. 9 contains a comparison of 
manual and automatic indexing techniques. 

There is an automatic indexing technique in commercial use 


today; however, itis a "brute force! adaptation of KWIC. Basically, 


21 





the technique produces index key words by comparing words from the 
aeerce to words stored in an authority list. There are many limita- 
Boris to this system such as correct handling of hyphenated words, 
plural forms, and proper nouns but the primary limitation is that the 
list must contain a sufficient number of appropriate words in order 
for a source to be adequately indexed. The size, speed, and complex- 
ity of such a system should be obvious. 

Referring to figure 1 it is seen that the indexing process produces 
index records. The contents of the records vary widely and are de- 
pendent upon the type of IR system (e.g., document, fact, or reference). 
In addition to subject descriptors, the index may contain the location 
of the information, source, author, reference to another eee record, 
or other information deemed pertinent by the system designer. It will 
also be noted from the figure that the information source, or informa- 
tion concerning the source, will also be stored in the IR system. In 
the case of a large document such as a book, it probably will not be 
stored in the computer but rather a reference or abstract will be 
stored as a substitute. In some cases, the index record itself will 
contain all of the information associated with an information source. 
For example, an index record for a library book may contain the 
book's location within the library, therefore, the system will present 


the index record itself in answer to a user's query. 


2a 





Die | OLA GE 


This section of the paper contains descriptions of various 
techniques used for organizing index and information files within an 
IR System's storage media. There will be no discussion of storage 
devices since it is assumed that the reader is already familiar with 
computer equipment. The reader is aware, of course, that the 
system's capacity, cost, and response time are greatly affected by 


the selection of various storage media. 


aA, FILE ORGANIZATION 

Organization of an index file or information file specifies the 
positioning of the records in relation to one another within the file 
along with the physical position of the file within the storage media. 
Choice of a rule which governs file organization is dependent upon 
desired response time, peak retrieval loads, system reliability>, 
category of users, cost, rate of information change, rate of system 
growth, and type of storage media. There are several rules for file 
organization which are extensively used in IR systems and they are 
presented here. These rules are equally applicable to index and in- 
formation files. 


ies seq uential Organization 


The first method involves the sequential placement of 


records within a file. The (i+1)°®° record follows (physically and/or 


aoa lity to retrieve a maximum of information with a minimum of 
false drops. 


23 





h record. For example, the alphabetical listing of 


logically) the i! 
subject-indexing key words, alphabetical arrangement of employee 
records, etc. This method is very conservative of memory space 
since there is no need to supply pointers or links to indicate where 
the next record in the file is located. On the other hand, additions or 
deletions to the file are difficult to make. Let us suppose that we 
desire to add a new name to the telephone book. Then all of the names 
which follow the inserted name must be moved. Likewise, the deletion 
of a name results in perturbation of the list. This type of organization 
is most commonly used with magnetic tape where records are searched 
sequentially. 
2. Chaining 

Another technique of file organization is called "chaining"! 
where addresses (links, chains, or pointers) are stored in one or 
more fields of a record to indicate the location of the next record 
within the file. Recall from the discussion of indexing that thesauri 
contain ''see'' references. These references are links which convey 
the idea of chaining. Chaining is a particularly effective method when 
used in a crowded memory since "referred to'' records may be placed 
in any available space within the memory (unlike the rigid sequential 
scheme). Also, the utility of chaining is fully realized in a system 
which experiences a high rate of information change. This method 


requires more memcry space than the sequential scheme since extra 


fields must be appended to the records to accommodate the links. 


24 





a. Branching 
An extension of the chaining technique is referred to as a 
"branching structure.'' Branching is used to achieve versatility in 
changing record entries, changing file structures, and conversion, 
where possible, of variable-length records to fixed-length records. 
A trivial example is shown in Figure 2. which exhibits the idea of 
branching file structures. 

Let us suppose that our file consists of all military flying clubs 
in the United States. Each record consists of the club's name, address 
fairport, city, state), membership, and type of aircraft. Obviously, 
these records are variable-length because the number of aircraft 
owned by each club is variable. The main file may be converted to 
fixed-length records by replacing the aircraft type fields with a single 
address. The aircraft types could then be included in another fixed- 
length file. The address in the main record links to an address file 
which in turn points to the file containing the aircraft types. Repetition 
of aircraft type is eliminated from the main records, main records 
are fixed-length, and changes are made only to the address file not 
the main file or aircraft file. 

Figure 3 exhibits another feature of this technique which replaces 
all field entries in the main file (except the name) with addresses. If 
it is later decided to add "county"! to "'city'' and "state'' then no changes 


are required in the main file but a field must be added to each of the 


"city-state" file records to absorb the new addition. 


25 





MAIN FILE 





















: NEXT 
NAME ATRPORT ADDRESS AIRCRAFT RECORD 
MONTEREY 
NALF 
17 MINOR ABC NEEDLES 
AFB INTERN ' TL 101 
CA. 
ADDRESS FILE 

100 
rO1 

CESSNA 
210 fo0 

CESSNA 
Z 
: : 

CESSNA 
az 180 

AIRCRAFT FILE 
CHEROKEE 

215 


PT-19 


Conversion of Variable-Length Records to Fixed-Length Records 
using the Branching Technique. 


Figure 2 


26 





MAIN FILE 


° NEXT 
NAME AIRPORT ADDRESS AIRCRAFT RECORD 


NALF 
MONTEREY 





CITY/STATE FILE BEFORE ADDITION OF COUNTY 


498 | MONTEREY, CA. 


513 ] NEEDLES, CA. 


CITY/STATE FILE AFTER ADDITION OF COUNTY 


498 | MONTEREY, CA. MONTEREY 





513 | NEEDLES, CA. XYZ 


Figure 3 


Addition of Records to an Existing Branching Structure 


ad 





Bee 1st Structuring 


, Although chaining and branching allow records to be scattered 


throughout memory, their membership in a particular file is main- 
tained by some order of relative placement (e.g., employee records 
logically linked in alphabetical order but physically scattered through- 
out the file). List structuring does not require that records be ordered 
in any specific manner within a file. Further, the fields of a record 
may be physically separated and then linked to form a logical record. 
The advantage of this form of storage is the freedom of changing field 
content structure, record content, and file structure. However, this 
method requires a great deal more memory space than any other 
technique. In addition, the retrieval process is relatively slow since 
more time is required to gather the elements of a record together. 

The three techniques of file organization described above are all 
forms of list structuring and each demonstrates a different degree of 
structural freedom. Chaining requires that fields remain contiguous, 
but records, while remaining ordered, may be physically separated. 
Branching is an extension of chaining allowing fields to contain address 


linkages to other fields. The last method allows any ordering and 


structuring of fields and records. 


B. FILE SEQUENCING 
It is important that records be sequenced (sorted) in some 


manner for use in IR systems. Sequencing is normally based on 


28 





some particular attribute of a record (called a sort key) such as the 
mame’ field of an employee record. Selection of the sort key is 
based on many considerations but the objective is to select the same 
Port key as may be used ina retrieval request. Subordinate sort keys 
may also be chosen when more than one record has the same primary 
sort key value (e.g., several employees with the same last name). 
Searching records which are ordered on the primary sort key is then 


called an "ordered search. !! 


29 





IV. RETRIEVAL 


The retrieval process essentially consists of searching the index 
files and information files for information which satisfies a user's 
query. If the information is found, it is sent to the user, if not, the 
user is so informed. It should be noted that 'searching''and "retrieval" 
are not synonymous. ''Searching'' is a file access operation used to 
locate records for matching against the query, while "retrieval" is 
the actual output of information which satisfies the query. However, 
use of the word ''retrieval'' here will imply the entire operation of 
searching and retrieval. 

As previously discussed in section II, indexing and retrieval are 
counterparts since indexing refers to the structure of information for 
input to the files, while retrieval is the process of locating and dis- 
playing desired information. Therefore, the query language employed 
by the system meee must be compatible with the index language em- 
ployed by the system ‘'esigner. It is important that the query and 
index languages use the same vocabulary in order for the IR system 
to understand the user's requests. The user must also be familiar 
with the system's logic in order to formulate an intelligent query. He 
must know if the system honors the use of Boolean relationships 
(Nand," "or, '' 'not'') and magnitude comparators ("greater than, '' 


"less than, '' etc.) as query terms. 


30 





Once the query is formulated it is input to the system's index 
file. A matching process takes place at the index file where the terms 
used in the query are matched against the index file records. Index 
records which match the terms of the query are employed as locators 
to direct the retrieval of data from the information file. 

The technique used in searching the index and information files 
is governed by the file organization (structure, sequencing, content, 
and storage medium). In the ensuing discussion of search techniques 
it should be borne in mind that whatever technique is used it is fixed 
within the IR system. Also, the interrelationship between search plan 


and file organization may limit file accessibility and search flexibility. 


A. FULL-FILE SEARCH 

One search plan incorporates a full-file search where every 
mecord Of the file is matched (e.g., the value of the query term is 
matched against the value of the sort key). This plan is used when 
the order of records within a file is unknown (e.g., a file of employee 
records that are not alphabetically sorted). In this case, if we were 
searching for Doe's record and found Smith's it does not follow that 
we have searched too far since the records are not collated. In ad- 
dition, there may not be any assurance that a single match satisfies 
the search (more than one Doe inthe file). Therefore, all records 


within a file must be searched. 


31 





PD. SHOURNITIAL SEARCH 

A sequential search plan might be used when the records are not 
only sequenced but sequenced on the same term as is used in the query. 
Sequential searches are normally used in conjunction with sequential 
access type storage devices. The records of a file are matched se- 
quentially until a successful match is made or when the value of the 
query term exceeds the value of the sort key. In this case, searching 
for Doe's record and locating Smith's record indicates that the search 
has not only gone too far but no successful retrieval will be made 


since there is no Doe in the file. 


S. BINARY SEARCH 

A binary search plan may also be used with a sequenced file. 
The term "binary'' implies that a two valued decision is made after 
every match attempt. The search begins in the middle of the file. If 
the first match attempt is unsuccessful then the next attempt is made 
one-quarter file length away from the first. The direction of the sub- 
sequent search is dependent upon the result of comparing the value of 
the query term and the sort key (e.g., if the sort key is greater than 
the query term then move one-quarter file aod the beginning of the 
file). Each successive move is then made one-half the length of the 
preceding move. If there are n records in the file then there will be 


approximately log,n moves to exhaust the file. 


32 





i. Dine! ACCHSSs SEARCHING 

The last file searching technique relies upon a special type of 
index file called an inverted index. This is probably the most common 
type of index file used inIR systems. The inverted file records con- 
sist of the descriptors produced during the indexing process. The 
descriptors are used as sort keys for sequencing the records within 
the index. Appended to each descriptor field are fields which contain 
addresses of the associated records in the information file. Some 
type of search plan is conducted (usually binary) for matching descrip- 
tors (which are sort keys) to the query term. Whena successful 
match is achieved, the addresses of the appropriate information re- 


cords are obtained and the records are directly retrieved. 


E. COMBINED SEARCH PLANS 

The above treatment of search plans demonstrates that the 
techniques are dependent upon file organization but plans may be com- 
bined in one IR system. For example, a binary search may be em- 
ployed in the index file to locate the disk and/or track which contains 
the desired information while a sequential search is made of the track 


for the requested records. 


33 





Von Rie VAI SYSTEMS 


This section of the paper contains a discussion of the primary 
differences between reference, document, and fact retrieval in order 
to provide a frame of reference for the development of a fact-retrieval 
system. Reference retrieval is treated first since it is the least 
complicated of the three types of information retrieval. 

Queries used in a reference-retrieval system contain only the 
topic for which information is desired (e.g., STEEL). The material 
provided to the requestor is a list of references pertaining to his topic. 

Document retrieval queries are narrower in scope since de- 
scriptive terms are used to modify the topic (e.g., STEEL, chemical 
| properties of). Documents are provided to the requestor which contain 
the desired information. 

Fact-retrieval systems are the most complicated and powerful 
of all since they are capable of providing specific answers to specific 


questions. 


ie REFERENCE RETRIEVAL 

Reference retrieval is the first step taken by one in search of 
specific information. As explained above, a reference-retrieval 
system provides a user with a bibliography pertaining to the topic for 
which specific information is sought. The second step in the search 
for information is totally unrelated to the reference-retrieval system. 


The user must examine the documents listed in the bibligraphy in 


34 





order to obtain the desired information. It is clear that in the first 
step the user's search for information is narrowed from a search of 


the entire "library" to a '"'shelf'' in the library. 


b. DOCUMENT RETRIEVAL 

The definition of document retrieval is not straight forward. 
One point-of-view holds document retrieval as the second step of 
reference retrieval. In another point-of-view, it is a special case of 
fact retrieval. What this author regards as document retrieval may 
be fact retrieval to another. The definition upheld by this author is 
the retrieval of unprocessed text word-for-word as it is stored in the 
information file. An example would be requesting a specific report 


from a technical library. 


OF FACT RETRIEVAL 

Fact retrieval ranges from the retrieval of processed text 
stored in an information file to the retrieval of specific answers to 
specific questions. The more powerful end of the spectrum is refer- 
red to as ''question answering'"'. Reference 10 contains an excellent 
treatment of the general characterizations, limitations, capabilities, 
and feasibility of the question-answering type of fact-retrieval systems. 
Reference 11 contains a practical example of a question-answering 
program. 

Confusion arises at the low end of the fact-retrieval spectrum 


where it is difficult to distinguish the difference between document 


35 





and fact retrieval. One point should help clarify the difference. 
Document-retrieval systems possess only rote memory which means 
that their capability is limited to the display of information word-for - 
word as it is stored in the data base. Fact-retrieval systems possess 
the capability of manipulating data stored in the data base into a form 


which best satisfies the user's request. 


36 


Teen treo UG TURE FOR A FACT-RETRIEVAL SYSTEM 


. This section contains the description of a data structuring tech- 
nique proposed by Kildall [Ref. 3] for use in a general-purpose fact- 
retrieval system. Specific useage of the system depends in part upon 
the type of information stored in its files. However, the nature of the 
system is the processing of data to provide a user with specific answers 
to his queries. Therefore, the system approaches ''question answering. "' 
The data-structuring technique employs the basic concept of hierarch- 
ical classification which divides a topic (also referred to as a universe 
of discourse) into its class structure and correlates the data elements 
of the information file to a tree-type classificatory structure. 

A treatment of the retrieval process is also provided here since 
the query format is directly related to the data-structuring technique. 

This section is expressly devoted to a discussion of the data- 
structuring concept while section VII contains the description of the 
general-purpose fact-retrieval system which employs the proposed 
technique. The system was designed for the primary purpose of in- 
vestigating the potential of the data-structure concept and not for 
production purposes. 

As previously discussed, fact-retrieval systems range from the 
manipulation of processed text to ''question answering.'' The system 
described herein maintains a position in the middle of this continuum. 


The term "general purpose" used here does not necessarily mean that 


37 


the system may be utilized throughout the full range of fact retrieval. 
Rather, it means that the system will accommodate files which contain 


different types of information, 


A. DATA STRUCTURE 

The structure employed for indexing data incorporates the con- 
cept of hierarchical classification which allows the user to enter the 
data base in a number of ways in order to extract desired information. 
A universe of discourse is structured in terms of "'classes'' anda 
hierarchy of classes is established onto which the associated data 
elements are mapped. For example, assume that a universe of dis- 
course consists of personnel records. The records consist of names, 
addresses, and telephone numbers which are members of the classes 
Pees’, "ADDRESS, '' and "TELEPHONE NUMBER." "NAME" is 
further divided into the subclasses "LAST," 'FIRST," and "MIDDLE"! 
males ADDRESS" contains "STREET, ' "CITY," and "STATE," 
The data structure is then represented by a classificatory tree with 
the data elements related to the classes contained in the tree. The 
data element ''DOE, '' for example, is identified as a member of the 
class "LAST, '' and the class 'LAST'' is a member of "PERSONNEL 
RECORD." All data elements of a structure are identified in this 
fashion. 


1. Class Structure Representation 


Class structures are represented by parenthesized expres- 


sions which are used to define the structure of the classificatory tree. 


38 





The technique of employing parentheses to define structures is similar 
to that technique employed in LISP S-expressions [Ref. 12]. Punctua- 
tion symbols used in the expressions are the left parenthesis, the 
right parenthesis, and the comma. The parentheses are used to en- 
close those classes which are directly related to a superclass while 
the comma is used to separate the classes within the parenthesized 
unit. Units within an expression are separated by commas and the 
entire expression itself is enclosed by parentheses. As demonstrated 
in the preceding section, "PERSONNEL RECORD" consists of the 
classes: "NAME," "ADDRESS," and 'TELEPHONE NUMBER.' This 
definition is called the format definition and is the foundation for the 
construction of the classificatory tree. Format definitions are 
represented by the parenthesized expression shown below. 

PERSONNEL RECORD (NAME, ADDRESS, TELEPHONE NUMBER) 

"NAME" and "ADDRESS" were further divided into subclasses 
and the expressions below show the parenthesized forms for "class 
definitions, "’ 
Nevo (ees he FIRST, MIDDLE) 
POO (ote, CILY, STATE) 

Subclasses may also be subdivided and this process is replicated 
to fully define the class structure of the universe of discourse. Figure 
4 graphically demonstrates the class structuring process, the fully 


parenthesized expression for the class structure, and the associated 


classificatory tree. Although the above example does not include a 


39 





/ SAvVN ‘ ugannan! 


\LAauLs LAWULS } 
\ / 
\ iy 
\ , 
NY 
‘c ¢ ¢ V ¢ ¢ 
(ALVLS AED LOdaLs) *° (aA TACIN LSU1Ld *LSV1) 





a 


(YAaWAN ; 
Ssauaayv 
ANOHdS TAL “AWVN ) 


GuOoda AdAOTAWE 


UAGWAN ANOHdMATAL * (ALVIS *ALIO‘LARMLS) ‘ (ATACIW‘S LSI‘ LSVT) ) 


ys: 


(MSENNN ANOHdSTAL ‘ssauaaqv ‘SANVN) GYOOEY ATAOTAWA 


Figure 4 


Parenthesized Class Expressions and Associated Free 


Structure for the Hierarchical Classification of Data. 


40 





subdivision for the class 'STREET'' one is shown in the tree structure 


to demonstrate a third level of class replication. 


2. Data Representation 


Once the class structure is defined, the associated data may 
be mapped directly onto the structure. Data representation is identical 


to the class expression as shown below. 


((DOE, JOHN, JAMES), (203 ELM STREET, MONTEREY, CA. ), 384-9363) 


| 


fect IRS), MIDDLE), (STREET, CITY, STATE), (TELEPHONE NO. ) 
| 


NAME, ADDRESS, TELEPHONE NO. 
EMPLOYEE RECORD 


Representation of repeated data elements within the record are 
easily handled by properly parenthesizing the record. For example, 
two phone numbers for John Doe would be represented by: 


((DOE, JOHN, JAMES), (203 ELM STREET, MONTEREY, CAL. ), 
(384-9363 , 384-6214)) 


The class membership of each data element in the record is 


clearly defined by the parenthesized expression. 


emo ye cern Witt liby, 


The utility of hierarchical classification in association with 


parenthesized expressions is realized by the user in three ways: 


41 





1. The indexing techniques presented in section II require the 
user to conform to the language devised by the system designer for 
the retrieval of information. The user does not have the option of 
defining the indexing language that best suits his particular needs but 
must be satisfied with the indexing technique employed to best satisfy 
the needs of all users. In contrast, this system allows each user or 
user group to define his own indexing language by defining the class 
structure associated with the data he is most concerned with. In other 
words, the system will accept a mix of data allowing each user or user 
group to have his own retrieval system within a retrieval system. 
Each user or user group must define the class structure of his data. 
For example, a business-oriented system might consist of a data base 
partitioned into employee records, pay records, stock inventory, etc. 
Such a system would simultaneously serve the needs of many users. 


2. The user has the capability of entering the data structure in 
several ways to extract desired information. In the personnel record 
example, the user may retrieve complete records which satisfy cer- 
tain search keys, or retrieve only the names of personnel, or retrieve 
the phone number of a particular person, and so on. 

3. The classification scheme could serve as an intermediate 


language between the query processor and the retrieval system. 


B. RETRIEVAL PROCESS 
1. Query Format 

Queries are presented to the system utilizing the same for- 
mat as class expressions. The fully parenthesized expression contains 
search keys and blank positions which specify the information to be 
supplied to the user. The retrieval processor will fill in the blank 
positions with all of the information contained in the data base which 
satisfies the search keys. The expression must conform identically to 


the fully parenthesized expression used to represent the class structure. 


(( DOE, JOHN, ——), (——,-—— , ——), ) 


42 





In the example above, the system will identify the class member- 
orp of each search key and blank position through the classificatory 
® constructed from the class expression. A search is then instituted 
for all records which contain an occurrence of 'DOE'' as a member of 
the class "LAST" and 'JOHN'"' as a member of the class ''FIRST,"' 
Information is extracted from those appropriate records to fill the 
blank positions of the query. The user may broaden or narrow the 
amount of information retrieved by the number and/or class of search 
keys used inthe query. A query containing only the search key 
"CALIFORNIA" could produce a greater amount of information than a 
query which has only one blank position. 

2. Boolean Expressions 

The ability to use Boolean expressions such as "'and,'' "or, "' 
"not, '' etc., is desirable in any information retrieval package. How- 
ever, the degree to which Boolean expressions may be used is left to 
the perogative of the system designer in satisfying user needs. The 
use of Boolean "'and'' is accepted by the retrieval processor in this 
system and is identified by the amphersand: 
a, ), (, MONTEREY & MARINA, CALIFORNIA), —) 

In this case, the names, street addresses, and phone numbers 
of all personnel who live in Monterey, California and Marina, 
California would be produced. 


The use of Boolean "'or'' is not directly used in this system but 


its effect is similar to the use of alphabetic and numeric range requests. 


43 


c— 


a 





So Alphabetic and Numeric Ranges 


Alphabetic and numeric range requests are identified by the 
colon. Examples of range requests are exhibited below. 
eee), (= MONTEREY, CALIFORNIA), ~~) 
The retrieval processor identifies an alphabetic range request for all 
data elements which are members of the class "LAST"! and which have 
weeemirst letter A, B, C, or D. The records of all personnel who live 
in Monterey, California and whose last names begin with A through D 
inclusive would be produced. 

As shown immediately above, the system does not restrict the 
use of alphabetic or numeric ranges to single letters but any number of 
characters may be used and any number of range requests are possible 
within a single query. 

The above discussion is also true for numeric range requests. 
For example, the user desires complete records for all those personnel 


who have specific telephone exchanges: 


ee ee — , ), 9102394) 


44 





VMileeoroleh NVM Str RUCTURE 


This section discusses the internal design of the general-purpose 
fact-retrieval system employing the data-structure technique previously 
explained. The system was implemented on the Naval Postgraduate 
School's IBM 360 Model 67 Computer and is an interactive system under 


control of the Cambridge Monitor System (CP/CMS) [Ref. 13]. 


A, DATA FILES 

Data files are stored on punched cards and consist of the following 
three types: 

1. Format definition cards. These cards define the class 
structure for each universe of discourse to be included in the data 
base. An example of a format definition card is: 

biMe bLOYEE RECORD (NAME, ADDRESS, AGE, CHILDREN) 

2. Class definition cards. These cards further define the 
structure of the classes contained in the format definition. Examples 
of class definition cards are: 

NAME (LAST, FIRST) 
PDD oomolnbh Tl CIry, STATE) 

3. Data records. The data records contain the data elements 

associated with the universe of discourse and are fully parenthesized 


expressions. An example of a data record is: 


EMPLOYEE RECORD ((DOE, JOHN), (203 ELM STREET, MONTEREY, CA.), 
(48), (MARY, SALLY)) 


45 





Format definitions, class definitions, and data records may also 
be acre into the system via on-line terminal. For a large-scale 
a base, the data records could be stored in unstructured form ona 
back-up storage device such as magnetic tape. Structuring of records 


would be accomplished under program control according to pre-stored 


format and class definitions. 


iB. TREE-TYPE DATA STRUCTURES 

A tree-type data structure is employed to represent the hierarch- 
ical classification of a universe of discourse. The tree-structuring 
process described later in this section employs data cells to represent 
nodes within a tree and the "chaining" technique to order the cells into 
tree structure form. 

1. Data Cells 

Data cells available to the tree-structuring processor consist 
of three fields. The description and function of each field is described 
below: 

a. The identifier field, referred to as ''TOP, '' contains the 
storage address (pointer) of the data or class entity which the data cell 
Pepresents. 

b. The right link field, referred toas "RIGHT, '! contains a 
pointer which is used to chain the data cell to another data cell on the 
same level of the tree. 


c. The down link field, referred to as ''DOWN,''' contains 


46 





a pointer which is used to chain the data cell to another data cell 
located in a lower level of the tree. Figure 5 demonstrates the use 
of data cells. A zero ina link field signifies 'no link" or a null field. 
@eporructuring Process 

Empty data cells are constructed in core storage through 
list structuring techniques and are stored in an area available to the 
tree-structuring routine. The menctice of a format definition card 
iemtiates the structuring process. The format name (e.g., EMPLOYEE 
RECORD) and the class names contained on the card are extracted 
and moved into storage (a discussion of this process is deferred toa 
later section). A number of cells equal to the format name plus the 
number of class names contained on the card are retrieved and tree 
structuring commences. The first cell in the tree structure is called 
a "header" and serves to identify the format name of the tree. Each 
of the classes contained in the format definition is assigned to a data 
cell and the cells are chained together. Figure 6 shows the structure 
representing the format definition: 

PMP wOyYEE RECORD (NAME, ADDRESS, AGE, CHILDREN) 
Before completing the discussion of tree structuring it is import- 

ant to note that class definitions throughout the various universes of 
discourse in the data base must be consistent. That is to say, if the 
class called "NAME" is defined as (LAST, FIRST) then every occur- 


rence of 'NAME'' must consist of the classes "LAST" and 'FIRST." 


If this is not done, confusion arises during the retrieval process when 


47 


TOP 


RIGHT 


DOWN 


NOTE: The numbers in the TOP fields are sequence 


numbers. 


Figure 5 


Data Cell Composition 


48 





CHBADER CELL 


EMPLOYEE RECORD (NAME, ADDRESS, AGE, CHILDREN) 


The numbers in the TOP fields correspond to: 


EMPLOYEE RECORD 
NAME 

ADDRESS 

AGE 

CHILDREN 


wm & WwW hw - 


Figure 6 


Tree Structure Composed of Data Cells 


a7 





the processor attempts to identify the class memberships of data 
eeerentel. Therefore, as each format definition is read, a search is 
| 
conducted of all previously constructed trees to determine whether or 
not each of the classes contained in the definition being processed 
have been previously used. If a class has been previously used then 
the tree structure representing the class is appended to the tree being 
built. If a class has not been previously used then a class definition 
card must be submitted to the tree-structuring processor. 

After the format definition card has been processed any class 
definition cards associated with the structure are processed. Figure 7 
contains a completed tree structure for: 

EMPLOYEE RECORD (NAME, ADDRESS, AGE, CHILDREN) 


NAME (LAST, FIRST) 


MOR too ol RENT Cily, STATE) 


OF INDEX FILES 

The system incorporates an index file, called the master index, 
which demonstrates many of the characteristics and advantages of an 
inverted index. The master index contains format names, class names, 
and data elements. Each entry in the index has a pointer associated 
with it which links the entry to a tree structure, data record, or 
further information concerning the entry. The retrieval process is 
always initiated at the master index since it is the agent which directs 


the search for information in response toa user's query. 


50 





te 
B, 


EMPLOYEE RECORD (NAME, ADDRESS, AGE, CHILDREN) 
NAME (LAST, FIRST) 
ADDRESS (STREET, CITY, STATE) 


The numbers in the TOP fields correspond to: 


1 EMPLOYEE RECORD 6 LAST 

2 NAME 7 FIRST 
3 ADDRESS 8 STREET 
4 AGE Ceca Ly 

5 CHILDREN 10 STATE 


Figure / 


Tree Structure for the Format: ‘EMPLOYEE RECORD" 


51 





1. Characteristics of the Master Index 
Conceptually, the master index is a large matrix consisting 

of fixed-length records (matrix rows), each containing eight fields 
(matrix columns), as shown in Figure 8. The first four characters of 
format names, class names, and data elements are stored in the first 
four fields of the index. Entries which contain more than four char- 
acters are then stored in a sequential storage area reserved for 
variable-length records. The remaining four fields of each index 
record contain information concerning the type of entry (e.g., format 
name, class name, or data element), the sequential store address of 
the full character representation of the entry, if any, pointers to infor- 
mation-bearing data cells, and other information useful to the retrieval 
processor. 

Peecoustructing the Master Index 

The first record of the master index is reserved as a table 

of all format names contained in the data base. The first record con- 
tains the address of the first data cell (identical to the data cells used 
in tree structuring) in a chain of cells and each cell contains the 
address of a format name located in sequential storage. Through this 
record a user may quickly determine the partitioning of the data base. 
Figure 9 demonstrates the idea. 

Format names are entered in the index and linked to their 
definitions which are located in sequential storage. Each of the clas- 


ses contained in the format definitions are also stored in the index. 


52 





oOo Con DW WN F&F WH Ph F&F 


Ree BS BS 
XY WW HO fF OO 





COLUMN (S) 
1-4 : First 4 characters of the entry 
Dat No. if the entry is a format name 
me" af the entry is a class name 
"UL" dif the entry is the lowest level class ina 
tree structure 
"Dp" aif the entry is a data element 
6 Pointer to the full character representation in 
sequential store 
vee: Pointer to associated chain of data cells if the 
entry is classified "L'', otherwise pointer to 
sequential store 
Cie: Pointer to associated data cell in the tree structure 


if the entry is a class or format. 
Pointer to associated chain of data cells if the entry 
is a data element. 


Figure 8 


Representation of the Master Index 


53 





STOCK INVENTORY 





PAY RECORD 


SEQUENTIAL STORE 


Reserved Record in the Master Index for Format Names with 
Associated Data Cells and Format Names in Sequential Store 


EMPLOYEE RECORD A 


CLASS 





Figure 9 


54 





Associated with each class entry in the index is a string of data cells 
which contain two items of information concerning the class: 

a. The first field contains the number of the data record which, 
in turn, contains an occurrence of the class. (This information is 


added when the data records are read and is discussed later. ) 


b. The second field contains a number corresponding to the for- 
mat name which contains this class entry. 


A class may be used in any number of different format definitions 
pitts structure must be consistent in every occurrence. Therefore, 
regardless of the number of format definitions which contain a given 
class, there is only one index record for the class. The data cells 
appended to the class entry provide the retrieval processor with data 
such as the format definitions in which the class appears. Among 
other things, information pertaining to the class entries provides the 
retrieval processor with the capability of quickly abandoning a search 
when a user requests information through a class which is nota 
member of the format being queried. 

Class definitions are processed in a manner very similar to for- 
mat definition processing. The class being defined is entered in the 
index and the definition is stored as read in the sequential store. The 
system returns the sequential store address and enters it in the index 
record. Appropriate data cells are appended to the index and the 
class structure is added to the classificatory tree. When the tree is 
completed, those classes which are end nodes in the classificatory 


tree (e.g., LAST, FIRST, STREET, CITY, STATE, AGE, and 


55 





CHILDREN in EMPLOYEE RECORD) are identified and their index 

eas are flagged. This is done to ensure that elements in the data 
| 

records are mapped onto the tree structure according to their proper 

class membership. 

As each data record is read into the system it is assigned a 
unique number and placed in the sequential store. Each element within 
the record is examined to determine its class member ship and the 
master index is searched to determine if the element was previously 
entered by another data record. The possibility of a data element 
appearing in more than one record te if the data base contains 
similar formats such as employee records and pay records. In ad- 
dition, a data element may be a member of more than one class such 
as the occurrence of 'JOHN'' as a member of both classes "FIRST"! 
and 'CHILDREN," It is highly desirable that there be only one entry 
in the master index for those elements which occur more than once. 
Unique entries in the index guarantees that when an item is located in 
the index, the search process is complete and successful. Additionally, 
the need for combined search plans is eliminated. Specific record 
and class membership information for each data element entered in 
the index is resolved by appending data cells to the master index entry. 
The data cells contain the record number(s) from which the element 
was extracted and its class membership(s). Assuming that a data 
element occurs several times in the data base, the master index would 


still contain only one record for the element. The record contains all 


56 





of the information pertinent to the retrieval process. The technique, 
relevant to both class and data entries, results in two important 
Savings: 

I. A significant reduction of storage space is realized (if an 
element occurs several times) since multiple entries in the master 
index require more storage space thana single record and its 
associated data cells. 

2. <A significant reduction in search time is realized since multiple 
entries require the retrieval processor to conduct a full-file search 
each time it enters the master index. 

Spe Wata mecord Table 

Cells appended to each data element stored in the master 

index do not contain the sequential store addresses of the records 
from which the data elements were extracted. This information is 
stored separately ina teble referred to as a data record table. The 
data record table augments the information contained in the master 
index and is composed of fixed-length records as shown in Figure 10. 
Each table record consists of three fields which contain: 

a. The unique data record number. 

b. Format membership of the data record. 

c. Sequential store address of the data record. 

The data record table serves two functions: 

a. The retrieval processor bypasses the master index and directly 
enters the data record table to satisfy requests for all data records 
which are members of a particular universe of discourse. 

b. The table is also utilized for queries other than those which 


request ''all data records. '' The retrieval processor searches the 
master index to determine the data records which satisfy a user's 


57 





DATA RECORD SEQUENTIAL 
TABLE STORE 
2 


=((DOE, JOHN), (80 WHITNEY,... 


moCoMimaeniicl). (32 CAPITAL... .. 
~ ((041305416), (WRENCH),... 


((DOE, JOHN), (094-63-3152),... 


((EA 3733, CONN), (BUICK,... 





FORMAT NUMBER FORMAT NAME 
1 EMPLOYEE RECORD 
2 CAR REGISTRATION 
S PAY RECORD 
4 STOCK INVENTORY 


COLUMN 
1 : Unique record number 
2 : Format membership of the data record 
3: Pointer to data record in sequential store 


Figure 10 


Representation of the Data Record Table 


58 





request. Then the processor enters the data record table and extracts 
the sequential store addresses of the records. The sequential store 
addresses are passed to the ‘output'' section of the retrieval processor. 
The information contained in the data record table is tabulated 
separately from the master index to achieve savings in storage space 
and response time. Storage savings are realized since the addresses 
of data records in the sequential store are contained only in the data 
record table and are not replicated in the master index for each class 
and data element. System response time is reduced for queries that 
request all data records of a particular universe of discourse since 
the data record table was designed primarily to expedite this type of 
request. The retrieval processor extracts all of the necessary data 


record addresses in one access of the table. The amount of searching 


watchin the table is minimal, 


IDE INFORMATION FILE 

The ''sequential store'' is the system information file, or data 
base. It contains the data records, format definitions, class definitions, 
and the full character representation of those entries in the master 
index consisting of more than four characters. Figure 11 shows the 
sequential store and its relationship to the master index and the data 
record table. 

The information file is resident in main core storage. The 
variable-length records of this file are sequentially ordered. System 


information files are not normally stored in main core unless they are 


59 





AYOLS IVILINANOS 


ATEVL TeOodd VIVd 


en 


Ce aa 


XHQNI YaLSVN 


pe inbas i, co cA 


aN 


Figure 11 


and 


Relationship between the Master Index, Data Record Table, 


Sequential Store 


60 





relatively small (which is the case here). However, it is imperative 
that such a file be resident on a direct access storage device in order 


to provide satisfactory system response time. 


De Per Rie vVAtT PROCESSOR 
The retrieval processor is divided into three operations, The 
identification operation determines the type of query posed by the 
user; the search operation determines the data record numbers which 
satisfy the user's request; the output operation retrieves the resultant 
data records from the sequential store and prints them at the terminal. 
Additionally, special messages are output to the user in the form of 
error messages to warn him of invalid queries, and messages which 
motify him of unsatisfied queries. 
1. Query Types 
The IR system designer strives to achieve total utility of the 
system by providing the user with a powerful retrieval language. 
Utility of the data structure used in this system is realized by the 
various types of queries available to the user for extracting informa- 
tion from the data base. There are four major types of queries avail- 
able to the user. 
a. Determining Data Base Partitions. 
As previously discussed, the data base may be partitioned 
to allow a mix of unrelated information by defining the class structure 


of each universe of discourse in the data base. A user who is 


61 





unfamiliar with the data base partitions (format names) may easily 
mee etre this information by submitting a special type of query. The 
Pormat of the query is simple and consists of the single search key: 
"CLASS." This is translated by the retrieval processor as: "Output 
the names of all formats contained in the data base. '' Search of the 
master index is then centered at the first record of the index and its 
associated chain of data cells which contain the sequential store ad- 
dresses of the format names. All format names contained in the data 
base are output to the user. 
Ore Ye CLASS 
RESPONSE: Dvir tOyEi RECORD 


vee i ORD 


b. Determining Format and Class Definitions. 

In order to extract data from a specific universe of discourse, 
the user must be provided with its class structure. The class structure 
determines the format for data record requests. Queries of format 
and class definitions must contain, as a search key, the format name 
or class name to be defined. The search processor enters the master 
index to locate the format name or class name, extracts the address 
of its definition located in the sequential store, and the definition is 


output directly at the terminal. 


62 


QUERY: EMPLOYEE RECORD 


RESPONSE: (NAME, ADDRESS, AGE, CHILDREN) 
QUERY: NAME 

RESPONSE: (LAST, FIRST) 
QUERY: AGE 

RESPONSE: NO DESCENDANTS 


c. Data Element Retrieval. 

One asset of the data structure concept is that it allows the 
user to extract single data elements from the data base which are 
members of a particular class and format, or members of a particu- 
lar class irrespective of the format membership. Since data elements 
are mapped onto the end nodes of their respective tree structures, the 
user must use the lowest level classes of the structure as search keys. 
Failure to do so prompts the retrieval processor to output corrective 
information to the user. The hyphens in the queries below indicate to 
the retrieval processor that the expressions Naeeries and not for- 
mat definitions. The processor could identify the expression by 
searching the master index for an occurrence of "EMPLOYEE RECORD." 
A successful search would indicate that a format definition already 


existed in the system. However, use of the hyphen is a simpler and 


63 





faster method for positively identifying the type of expression submit- 
ted to the system. 
SOI Ky: EMPLOYEE RECORD (NAME, __) 
RESPONSE: ENV ID Ui RY = 
DETERMINE DESCENDANTS OF: NAME 
USE DESCENDANTS AS KEYWORDS 
QUERY: PMe LO RECORD (LAST, ——) 
RESPONSE: BROWN 
SMITH 
THOMPSON 

To answer the above query, a search is conducted in the master 
index for all data elements which are members of the class "LAST" 
Pouogece members of the format VHMPLOYEER RECORD," This infor- 
mation is contained in the data cells appended to each data entry in the 
index. Elements which satisfy the query are taken directly from the 
master index, and output at the terminal. 

In the query below, the hyphen is used to differentiate between a 
query anda class definition statement. All data elements which are 
members of the class "LAST" are output irrespective of format 
membership. The format membership fields of the data cells are 
ignored during the search of the master index. 

QUERY: EAS ea.) 
RESPONSE: BROWN 
CHAMBERS 


COLTEE 


DOE 
SMITH 


THOMPSON 


64 





qd. Data Record Retrieval. 


| Data record retrieval is the most valuable and would be the 


most frequently used type of request available to the system user. 
Extraction of complete data records which satisfy the search keys 
contained in the query is accomplished. To retrieve data records, the 
queries contain data elements as search keys and may contain Boolean 
N'AND,'" alphabetic and/or numeric ranges, or any combination thereof. 
The query format is a fully parenthesized expression as shown in 
previous sections. Search keys are positioned in the expression with 
respect to class membership and hyphens inserted in those positions 
for which information is requested. Any variation from the properly 
parenthesized expression prompts error mescages from the retrieval 
processor to the user. 

The retrieval process for the query listed below is explained in 
Ete following paragraphs: 

Puimeevnh RECORD ((DOE,——=), Ca. -—, CA. )(), (_)) 

The format name appearing at the beginning of the query expres- 
sion informs the retrieval processor of the universe of discourse in 
which the user is interested. The processor then traverses the tree 
structure for "EMPLOYEE RECORD" to determine the lowest level 
classes in the tree. This information, in conjunction with the proper 
use of parentheses in the query expression, allows the processor to 


identify the class memberships of the search keys contained in the 


query. The user is notified whenever the processor is unable to find a 


65 


. . ae 





Peehemrcy Ji tie Master index, In this case, the processor attempts 
to recover data which satisfies the remaining search keys. Similar 
action takes place when the processor encounters a search key which 
is not a member of the class specified in the query, or if a search key 
is not a member of the format specified inthe query. Additionally, 
the user is notified whenever the query is improperly formatted. 

Each search key in the query is processed sequentially. The 
retrieval processor searches the master index for an occurrence of 
each key. Record numbers which contain an occurrence of the search 
key are extracted and stored ina list. After all search keys have been 
processed, the retrieval processor ''ANDS" the record numbers in the 
list to determine which records satisfy the query. For example, 
assuming that two key words are used and record numbers 5, 32, and 
67 satisfy the first key word, and record numbers 32 and 67 satisfy 
the second key word, records 32 and 67 are output to the user. Re- 
cord numbers which satisfy the query are passed to the "output" sec- 
tion of the retrieval processor which retrieves the sequential store 
addresses of the records from the data record table and prints the 
records at the terminal. 

A user has the ability to immediately examine the results of his 
query since the system is interactive. The results of one query may 
prompt the user to submit another request, either broadening or 


narrowing the request through judicious use of search keys. In any 


case, the user is guaranteed that if the infermation that he seeks is 


66 


= 





contained in the data base, he will have quick and easy access to it. 
Appendix A contains a sample run of the fact-retrieval system and 
demonstrates all of the queries available to a user and the system 


responses. 


re, meleRING THE DATA BASE 
1. Changes and Deletions 
Due to the experimental nature of the system, no utility 
routines have been provided for deleting records or making changes 
to existing records. Alterations are accomplished by manually 
changing the card images in the data files. 
2. Additions 
The addition of data records to existing data sets or the sub- 
mission of new universes of discourse are acco~:plished most easily 
without special utility routines. This feature is inherently built into 
the system through the data structuring technique. Addition of a new 
universe of discourse is accomplished by submitting format and class 
definitions, and associated data records either on-line through the 
terminal (automatically) or off-line with card images (manually). New 
data records may also be added to exi. ‘ing data files automatically or 


manually. 


OF, 





VIII. CONCLUSIONS 


| Characteristics of the data~structuring concept as used ina 
general-purpose fact-retrieval system have been discussed throughout 
the preceeding sections. These concepts are summarized here. 

The data structuring technique encompasses the concept of 
hierarchical classification whichis the most widely used method of 
indexing. Hierarchical classification of data is a relatively simple 
technique to use but possesses the power to divide and subdivide a 
universe of discourse into more specific subjects. Additionally, 
hierarchical structures may be created to include a domain of subjects. 
This is advantageous for use in a fact-retrieval system, as previously 
demonstrated, by providing a mix of structures ina single data base. 
Therefore, users with differing interests are provided simultaneous 
access to a single system since each is provided a '"'personal'" retrieval 
system within a larger retrieval system. In addition, the hierarchical 
structure provides a user with multiple avenues of access into his 
information file. 

Parenthesized expressions serve as an intermediate language 
between the query processor and the information retrieval system. 
The query processor is able to determine the class memberships of 
elements within an expression by examination of the parenthesized 
form. It is apparent, however, that the use of parenthesized expres- 


sions is cumbersome and demanding since misplacing parentheses is 


68 





easy to do and causes loss of meaning of the expression. On the other 
hand, it can be argued that the technique of parenthesizing expressions 


is powerful and an equally powerful substitute is difficult to theorize. 


69 





APPENDIX A 


CANVY “IW LSUIS“LSV1) avs 
e9WeUus 


CSSIM“YOU “SSSYUUVA “3WYNS) TV 1U0S/84S9 
*®{2@}90S/g 4S 


SLUVUNJISAG ON SVH dWVNG TV Ih 
*9WRUL| 1yd 


ShiiVGnNIUSAU ON SVri 4dUYV 
w¥I9JSe 


(S3LiVLS°ALIO“LISYULS)SSSJUUCY 
eSSodLpe 


(LOSUIA“LSV1) 3NNWN 
seIQJWeuU 


CANVNUT IHS “abv “SSSUUGV “SWVN ))GYUIZURSIAO Id Wa 


spsOdudvgAO[dWa 


‘SUOTJTUTJep SseToO pue ZeUIOF OTJTOAads ysanbay 


aLaIdWOO LS4Nbsy 


WIvOS/T9 hf UYsbWiIN LYWuOs 
VWIVOS/8LSD € Y3YSWAN LyvWYOg 
UYOUSYAVd 2 UYSYWOAN LYWUOS 

T 


GYOUSYASAO dw YSUWNN LYWYOS 
#sseto 


"(suoT3T31ed eseq ejep) sauweu jZeUI0J [[e ysenbay 


70 





diaij1ld Vivid v sl 3OU 
DAdsnU UITVANI 
#(-%n (2 So ey “(CUYO! %=) dO} 


aWwVN LvWuGd Vi LON SI 3awWV¥N 
>AUZNU GITVANI 
(2424 ( 24/2) (= 0p) oweu 


dala TdWOU Lsdlivad 


CAdVvil 8h “COT NNOU “Ldll %* SAVASHNLIHMO8) “CVO CAULWYLISULSHI4E0Z7) ) “CNHOP %30U)) 


a( 22% (= % 2%) M(H Op) )psOdvdVVAO | GWid 


Adano stil AaSJivS SQU0Uda ON -ddad Tatil LON fossa 

GdLSIT auvANV JI’SuUuUMAdA USIHLO UNTAZSIT LVS SUGYyO0odY 

*Ada0 sblb WN) Ca tal ads SS 0 aos 

30 Usui Vo LOW St Le UNNOS SVK 4OU 
(2/24 (2% 2 fH) (9 Op %-) Ppsaoddv4IVAO[ dws 

‘spiomkey otytoeds Aystjes YOTYM sproder eRep [Te Isenboy 


dL31d N09 LSSNVIY 
aaLlLor 


SWI TTOQO 
¥(-’3SeX1)[eO1908/3g3S59 


CMNVU IWS LSYUTAYLSV1)dWvNS 
~oWeUus 


SUYOMAKA SV SINVGHAUSAG 3Sf 
dvtVN3 = =4O SLINVUNAUSSU ANIWUSL5U 


: *AYdSNO GITVANI 
¥(- “uWeuy)Le}I0S/383S95 


-SSBTO OfJToeds e Fo sAzsquioul ore YOTYM squswetTe ~- » TTe ysonboy 


71 





dLa1dWOU LSaluss 

CNVE “(99 °DS9G) “CHOSZ “T9NONZ9 TiwuVu’ SAVS LW INVSULES ) OC LT OV LYS dOU “SHLLOA) ) 
CALLIN “°C 89 VE) OC SETS PT HOLNZHCOSVNITVS “SA LYUSOTVdGSOT8STG) “Cull ’° a “Sdhive “SNIT100) ) 
~™(-)L[@!190S/g 3so 

dLd1dWOO LSINDIY 


(NV? %( 997930) “CHUSS “T9079 “TSAUVU °C SAVS LO INVSOLZES) CCLE CV“ LUSHOU“ANLLON) ) 


SUYOMAIA YSHLO VNITAASILVS SUY0VdG 
GuoVvsddsSAV IdbWi3 
gO UdeWoil Ves 

*AUslivu ahl NI UsldluddS Lvwy0d abl 

JO djgowall Vv LOW St ANd uliiOd SViA bygNndlid 
(9% (- fn) O(n an oH AH) (a Fe OH 7 FONG Y9HIIOH) PLLILVOS/BIYSO 
aLli1dWOU LS3Nudy 


(AL1L49°C89%NVE) YU CSEZSE CT HULHTHNESVNETWS “SAL YUSOTVdSOTSTG) “CUGUE °4°SdiWVP YSNI1109)) 


#( 9% ( 92) (no fo =) (2 = (Same | “SUL | L0D)) Le 420S/g3S9 


JLIIdWOU LSanudy 
CALLIu “(89 7UVE) OC SETS MT HOLNT HY SVNITVS “SA LYUSOTVdSOT8TE6) “C007 "3 “SS3NVE “SNI1109) ) 


UALSIT SUV°ANV SI’SGHYOMASA YSHLO UNIASSILVS SdYu09S% 


-GHNOF LON SVi SN 1700 
e(- (2 lH) (ao fn oe fe) (Co Me “Some f ’SUy109D)),[e;I0S/g 3S9 


ALITdWOD LSINOIY 
CNVE “C99 %I50) “CHUSZ “T9NONTO TaWdVva “SAWS IN INVSULES ) “C11 V“LUS8OU “SALLON) ) 


#(- (89299 %H) (ne a nH) CA He = =) LEY I0S/8IS9 


(i 





AYSIU gril ASSILVS Susddudy OW FG4771dINSs LUN LSan03y 
™(- 765206 “(- 2 = 2) (f=) )psAOdus4v92A0{ dwa 


LVWYOS UdlalUadS atil Wl USNIVLNOO 
S4SSv190 SU USEHWAN SHL SUSSUX9 SNOILISOd 
UUCHASA JO YadWwOAN -AYRSNU DI IVAN 

(= 4G5°05 “(- nH) (fH %) )ppaODudVIAO] GW 


JLIIdWOU LS3NUsa 


(CCRALSTUN) “SC S4UUI1SU))Y7E SOOO VN LUVIN STUMP OLSSu0S684) CC UUVHOTY % LOUNYL4d ) ) 


CAUVINNS HCCC NNOUD%LdS oO SAVASNLIBNOS ) “CVO %AdLWY LSSULSWISE0Z7)) “CNHOP “40u)) 


a(n ln (a fe fH) M(H 7 FDDNALVAYDOL ) )pPAODIIAGGAO] UW 


ALA dWOU LSsitusd 


(C(nalS buy) “Catena ))° 26°C" Vo VNU 5 91 oS su0sd68h) “ COdVBIIY Daud) 


UALSIT SdVZANV 3I’SUdGOMASA dahLO YUNIASSILVS SGuOUsa 
7UNNOd LUN SVM JAUNIZId 
e(— lo 8 (oe Cn) OC pae yur yd dodusyZid) ppsaodvsavvAO[ dws 


aLdIdWOD LSINDIAY 


(AGC 31 ECC CO NWOUU Ld? °C SAVASNLIEMO8) OCS V9 SAUL LESSULSWISE0Z2)) “CHOP “°300) ) 


a(-%-%( = “AAW MN ~) NK ( = MDOP) ppsaodvsavAO{ dua 


AgdilQ dnl AJSILVS SUYOUSU OW eGaTTIs1Ns LON LS39N034 
a(n 2 (He CAA em) (pd Yd fd “dOp) ppsoOdvd4vvAO[ duu 


73 





*((UUOD “PL oly4 pes’ ps4 YSOU ¢EeHT) 
“(SENOL MAL L9H) “CAADIALAYHIDING GQ) “(UUOD’E/ GAY) UOLJe4ASI ss sed 


»((UU0D“p4uodjwesS’* td soeneYy f) 


“([aeO% weyesd) “CURPYS “MA 499) “(UUOD“Zgs/H4Y)) YUOJJEAYS Bes sey 


»( (UU0D“ZAOdISYMN~* YS AVALRY ZETT) 
“(WEYEL ITA fy tus) %(y4e[AXs “HIDING 799) %(UUOD “CEL Cks) UOJ JZeAAS!Jov4 4ed 


¥( [9pow “ayeW “4RDA 4D 
(992 O4S “4vGunu )AI4S I dud 


~(SSddppe “vweU “409 7AAQS [B94 )UOIeIAS| BGI 4RD 
"9seq e3ep 947 0F (Jes BJep) oWeu JeWIOF MoU © FO UOTITppY 


3l41tdWOO LS3NOd4 


CCCugLS tu)’ CS¥uIdU)) 720 “COWS CV luv STUY OLSSu0d68 4) “COuVHOTY LoOntLids ) 
CAUVI S17 COT NNOD Lda’ “SAVASNLIWNO8) CVO CAdLW’LISULSHWTS SOC) ) “CNHOP “40G) ) 


e( ola (a le ln) (Ce OEP) ppsoovsaaAO| Gus 


3L31dWOO LSANUSH 


CAavid 8h “CC TNNOU 1d oO SAVASiWLIHMNOS8 ) OC °VO%AULW IY LISULSWISS0Z) ) “CNHOR “30U) ) 


(= °GS20H (= %= “= ) “(= “= ) /psodesaaAO0|[ dws 


74 





SPo1d WOO toad 


CONNOD “LUOdLSIM’“LSUSAVECEIT) “CNV ITT MiliwS )%CuuVvIAdas “No Ine 799) “CHOU NSE LE V3)) 
#((9%9%n) (a) C= = 702259) “(UU0D “~) UO Je4IS} dud ALD 


dLa1dviO0 LS311034 

(CNNOO “UTS I SUIV A UULSOdS SHT) “CS INOTZATIIN) “CC VASIALY OND Fd 7S9) *CUNOU 826A) ) 
(ChNOO “GUOdWVLS “Idd Nvut) “CTdva “vA ) “CNVUSS “TA 09) “CNNOO “8ELAU 
((NNOD“LUUGLSIM’ “LSUSAVAZEIT) “Cav 117 Bail WS) “CAUV IANS 799 1d 799) “(NOD “EELEVE)) 
*(-)UO}JEASJso4 AD 


(4LVLS“ALIO“SLSSULS )SS3UGUYV 
«aSSaippe 


(LSYIS°LSVT)SWYN 
~soWweu 


(J3LVLS “S3SHIIN JAULS 1934 
*AAS|s04 


(SSSYUGUV “SWYN “YUVO“ZAULS JUZSU)NOILVULS LUSuuVo 
e#UOIJEARSIBIA AeD 


“oUeU JeEWAOF MOU 3UW OF BUTUTeRAed soTazsand 


75 





COMPUTER PROGRAM 


— ” ® 
© a + p- VY) 
uy > Gi 2a 
— + ® YM Wo 
© = WY I Ww °e 
< ~~ ~ ar WOOD 
= ad a Oda Ww 
cc) =) <[ a e [=A ZN 
© LL -_ ce CW Y 
ae Y WY O OOdw ow 
Co — ~ ~ WY) LE IW 
—_— © ae “~ «OO or @ 
oO Oo o~ © =) <ZWwOd 
—~m © x - WY) LL es = = OL 
rt at OO ome - ~N ~- © aeaot Ww 
ONTARC = ~ CI n 2 met oO I OC 
O wr > CL VY) ~——S <I OoOoadt<a 
we SYOW ~ am SS oe WOT = 
Zee e>—lUS wet ON] VY) Lee) 
=SsNnOenr> D zemelen rel ae 
Ow «xD ee ©) Ske FH WW MO JW <f 
Owl "OS OxK™Ne© OO f- ee pe 
Fe eM Z2COMY & ZOWenO Z@ Sd GG 
am & &O ee OKC —Iezr HnWe D SLE ty & 
Nat O UV S&S WwW Om wwe IN © Lu MLL 
OTOH OO aN Oe oe) rMuwW 
OAS ORM & eZ De NYOWM pwd TZ 
we Ne 9) OL CO Os C—O e a wad Fe bet OC 
~ pe See Or oe Creat oS Li. LU O 
NM DCDeR=z ZAM e« RCO ute K <f GOOOWO 
| QOM Wms ew eT YOrRrSTOOWNM - pew SLU 
< aS OTDPRNOCOWLYO ewk& &- & =O ate 
— OW ew &O elt & & & ew NN Lu OrFN2Z 
mO eemm commie LU OY Lm mew Moe ene CY C7 Lu 


NK Ose SsOmOTINOSE IMM —-Ost WW ke 


HOMMKHMENOMM >> «cate owwe co SA “~ x RRR tr 

CO DON Ow I INE FF OONN Ne > ~ a. ge Nat a 

Umer Ze Ow OT e& eZOYdTOZAe xXjct Pras << << YOR SOY 

ONdeeyYWe ON OTOD ewe COOde & <[ oe ee | ZeOOOct «KK KK ~ 
Wty OonarvrOwvwOZNZt New we en eau) Og ~a YeIUAAW UOYNINORFO z 

RmrFAWOOW ZW ke Mw Y ~~ ct Oo 74 Ol 1 2 Onuw ww uu i © <q 
ZUDArF SUYMNNNWYN We Xe > UY) II Waste OPTOWKOWW mane s yl 

— WOYD>KOOwO UTWw mt AJ ~ eon eh #9 ei COZTORVMTuOw “oO 
22222222232 T0eRH 2 Ze ewe oN ce wd el es eed oO mt Qa pe oc pe (i) ee em me & ee ee w= | 

rFODOODODOOOORRLULNWWN I Nee ete Ree WoW PUL ee RL ee CR RL ONNWN)D tt 

Pant bat bd et ped Det bed ed eg ARN JO Oe Ly Il I} oC pet pees Te ee ee Tes te ee LE ee OY) LL fre ee ee ew er ree st) ot] IL ts 
ONMNMNNNNAMNMNZ 22222440 Il Swett aene aad oOo ew 2 KK KE ROO KY Ob KK KEK OO meee — 
MBP ZS2ZZZZZOO0O0O000>> &© we mI tl WM ORD ete LULU ee Ome et QW TUL RMS INO OT 
SWI WWW SSS SS Re IN NN ee ee Ni On eee one ee meee QDADAADOO www ww Sw ONO CO 
ASSESS STS TTS TST TTDI DR NR Oat TN OW tt DD raza 2 2ZZZ2z ANMNZZTOWZZZZZ2Z2200C0000 lil tea 


= et ee St et et tH OO OOOOCORASMIES>P WSN IAW rR ZI ZOO CK OOO RM OMOOCORWWRHDH AY ett ee ee LU LL LW Ls Le Ou | 
MOOOOOOOOCOOCOOWWOWO Ie IOAINCOORPWILEULMODOOVCOAZOZOOCOLWOYCS Zr aIESTTTTETENNNNNNNAMNISY 


ot | | 
retO rm 0 TOU 
NIA NN N rt 


C----- 
C----—- 


' 
| 
| 
| 
O 


C--~---- 
C----_- 


| 
| 
| 
} 
O 


76 





THE 


ED,ENTRIES 


NO DATA 
AND 


’ 
A 
R 
’ 


Y 


’ 
U 
E 


ECORD 
TIONS 
TRUCT 

TABL 


R 

I 

5 

D 
MISSING HYPHEN® } 


LL 
z= © 
_ be 
aerate) ~ rd 
xe we — 
Lt — 
he ~ - 
LLj LL ow me 
Eyice ee LL 
= OC ae 
© WwW e <I —_— 
os Oe CO cD 
<I OC =) QZ 
e » — eS 
pale Y = Y) 
<i J Sh ee) 
fe S EC WOW 
Ore = © Ov ea ets) 
Oto wv . oO c= Wd IO 
an) ~ @ (ty — [ed(e a 
= Cy, 27 LL =< oe: 
Ovw il ® N oO tmnt L/? 
—aeo = = amt) bem — ~ Ow 
Se) ~ N + | Om wr co Ze 
Ozat = oe YCXroO eS LL -t <I LL 
Oe J = — b= -— ©) za uy G — 
<I~ ~ox WW — ke © LW LL © 
—eUOm ww ae) a6. © —_ <{ = b— oS CO 
ar ox is wou — — = O a ~ 
eh LU NO il ie re © e mee ©) © rk) ws 
SS ws rt OE Jor fam ome WA CY Ow Ca - 
WwoOtiwW~ “OO OOM Ou OF - OW © 
_ - — - © andq< ee WN WwW Mw =~ ee o) 
~ZNVU= mM O bk Ow, WO = oo e oO 
LL ot LL EC — J mem) elu OM <— <t Ww oS 
— NYO Saco OO e 8 -@)'e) = — LL) ~ 
YOWwW< oO Coe Voce Oe OoO= oO —Y wr O © 
Sim 2 = || CO ee e oll OO Sweet o me ee) eS 
ayvyY~ *OFUO0— =0C CI e ° ~~ ag ~~ Li co Oo 
Nel eWWOwc eCOYM 0 or Oe NW Ne & e — LL) — 
JIS &O = rom on + TO Oro e—_— Om — ~ 
HAMMW INNO Qs YY NW -— t- at St LU <f =<. 
Wu i eso ew ed eee YY) || ee we LL e©eOm ct tl thw ae Li 
Some SONS OPN YY YD fe = ST Gee SY YD =e eQ 
OO me YOWO~Kr, ~YO CMYK ZOOOXY FAZD ~~. Dawa =e. — 
Peel TW {QO OOOCOOCOCOsOO WOUdt= -Oaes~OesHivO 
CNYESROrFWODTRFOST MSEEBRr+¢YVRrROAYUDO SSereTzSzriwe a) 
Re CDeot Ode ae www 2M Mww ~OU!NO ~wDD~Z™7mFA ot 
QOOOLW LSX<WOOWLWOLIULILO HW HLMmlOWWOm OL WOWO IiwWwa <I 
Cn OT et me LS LO tO rt OS OT DMO MT ZO WO 


oe 4 


it 
= 11on o@ = , NT (eo fo 
NN ea 9) mt ist to 
wn an 
| | 


11 
1 , 1 
rid 1 | 
ils 11 
il | 
Ome CoC tC) 


FOR THIS FORMAT. 


E 
? 
S 
X 
GOTO 251 
GO TO 56 
GO TO 71 


Bacr | 
Oe 57 
ORM 
EX ( 

=0 


LN <i 
Ce = 
—~2 WW 
om Y) 
ToO~ a— 
Oex CO +— w= CO 
COWOYIZMNNOXK e& 
MWOOOID em ila 


).NE.0O) 


NEXTC(R) 
8 
R 


~ Ih 


BFDONZ ES = STeDO~S —m™ erNOWCZWW=-C~ 
Mw Osea > A Od oe aS Nes WOW COYSXK 
eY WOO OTtawatO i Z=zOe te wel 


YN OWD INH aO~EErEZYIUAY rR SEeUr TO 
SWUOOG~ ee He TEI~O YN DweuK ZI 
XT eWCWLOYLWILYOOZ Ii SLZORe OS IwLIWiLeSS 
fp et OS Yet OS et Nee SLL ZT ZO KIL OT OlL et Zee 
1 te 4 <x 

rit On Oo 0 } 
pir IAL a byw 
ee a N | 
lisfishs let 1 
UOUUO WO 


cd 





= VY) LU 
Co a wo 
LW a= 
VY) — Cy} 
om LE) 
LL ae 
Pas e c — = =~ 
| > UL ® © © 
cy VY) LW Lu >< 8 © 
oe) LU e VY) QOaw wWw © — 
LU VY) LW <I wad LW = 
ce - Lu = >~TZNe © 
eG) Za cw W ened fm at a OC — 
= — — Y ~ mm C5 
VY) — We WW © 
Cy o VY ©OcwW e2- ~ cS 
ze lw <{ or =O Oe a. 
8 x = = > wLWMLW © am 
— © ew tL Wiw<t2O ® = 
= _ © Cee Co a uw 
+ us Y Li. LL Gt te Ww ° © 
~ aa ae ete Gl) PENG) e CF ml 
ba LU Ww -— zZeTOwWw = Lu 
©) Ee =. LL Wee YW e © 
b= LL) <{ = ox Wwiar® >Od oF om — 
oS Pr OC OW 2 J<t = me = 
> ~~ = © Yes DMO =) a ap) 
CO <{ <{ Ee or TLNOZ eo = 
a — = = Ww I WU) H<f Ee <i O - co 
~ cr or — OQ QO= > e <{ <~ ~~ 
GC) = 7) © J <{ YW wwoa mice 9 za 
_ <{ LU LL Ww LWW DD) > b= of LU CS (es) fod 2S <{ 
as) — WO a IoyZake ’ = wn = 
Lz) mM — LU WwW = ww << om <[ e ome mM 
ap) ° Om a aw & LWW Ca) nae oo LN <{ a 
Lis O — — peer LL Te CO” WO I= oO Bs O LL) 
-~ a = wef TY) LU ad a = — Ze 
© 0 pe =) Ow < © OZan WW -—O—= O 8 
e -_ om Jem bey oom pe SE LU CO =— xa WwW ww e 8 mii = ( © ~ 
om CY ~ <{ A). oo Wu Q Ee WwW Ona age, COS — ° C5 ee 
COW) ~ Onl~ Ye Waa cH i il — YW <a{_ Ode WwW =O & os 
~ = = CONS Y WeZse YC Lomo cf —~ IZ LL bet LL Mat! eso DOD WwW -~ — > 
te ate -_—7 = = a sD <I Fe, «D <I OW oO je — — VY) eon —— C3: Ss) 3 ro =o 
w= ZZ YN — em ©) —_— 2 eYWO j— > ol e Wi N<iN-— al Ww YX qt — We ~— Il 
x<OW WOW I xX Aye Y mei <f S) met et eS md et et COLL ed OWMNUWN bl<—-_- SZ Ww i — ome] <2 
CO) LL) mee ner mn ee ff me Ww Wet HW Why OFZ Wh WWOtae-ASZ= Ae lZzaqaeaa ¥Y¥YY ~~ ~ em = OO MW Wi 
ZxXM UKE +0 +21 COM ame te et Ott ee OK KKK OD AazOAOZ wae + DD —( ma > =WY ++-+2oO 
mt LL ee LL et ee LU ee et FLU ee IL ot YY DD W Oe Fo) lL efesOnO a tee 
eee ete eR Se NSS TSE Oe NEN) ODO NE OOD Oe et = OO ZW TODO OWE Say UONMRFK +t cate URS a eM 


Hi = I] <x WOWWH I~ i KA OWNS N RK Ss OI SO he Za Ione Ls I EO Wil | CW) Www AM DOL Ole 
OPS wSOWS Sw Se YO SW OW SW OW OW We ST OHO LTOMVOOWTOWW es MOW OC MEL WW eb et SLL OO 
OU ROO ZOCMNVS Ss SOS Z”NNQNNNZANNOQNNS ZS HSOKSzRKOLOLADODWZsODDODNL |S ee SN OO ZO es el EVO 


| ried 
aN oOo wm ac Oo ~-O 


| MW Ore © Va) (Dy uy 
ee a ae Oo SS 


oOo ae Ov Ov © © 
wae r= 


N 
© 


C-~---- 


' | 
rtf 
td 
rtd 
OO’ 


C----- 
C---- 


| 
| 
| 
! 
O 


C----- 
C----- 


» 


® 
ome e 
ES LL) Y 
_ a Ox 
_ -_ rs J 
=< — @ 
b= Lu pang LL 
LL ae aL. 
Lu al — @ 
@ LL 
° a, LLL 
KO be O- 
<I ow = 
i Y) YY) 
Ow e Y UV ~ <I x< 
ieee wy YM = —_ LL} 
- <I - WOO 
LL) fame — eS z 
ok < © ee) b= ~~ S) mmet 
jem LL} = — oO eS 
_ a uy ce) ©) = Y) 
an) ee =) Ww i eS) O WO 
= bem LL} oo = Za Co Cs UI 
Ss © SY) - © uw = wm CO Oe 
9 b= Y) >< — u zt & ~ (Se 
th ®) IZ — ey <I o« O-— Aw) 
TN S&S at bat ® eS <{ i oe @ 
— pred (ee) W Co O > a> -m © LL LL 
Y CQ Lu al 2a oD a ed Eyez 
Ye ~ ~ jam «xf e — Y) ® Co we <I @ Oe 
LL met x © Ce <u ~ © ae = - UES oN LL! cO b= <f UO mi — = 
i © Yo ~- W e eco ol a wy @ cw Gy 2 ect ee one Yom = lA. Mas CO 
—© Il Ni ae eee Hi ee ee * G&G i — eOct amketYe _ ey) U Waza ~ 
~Oo— =a“) mw — a —™? WW UW _— = =a m=O “YNZ w— LL} me Lil Om met C7 
ae Oey caIiNe AYN DO Wl Y Ih e AY © Ut et «@ Quwedt & z@WO ~ YA ~-x~D aN ~ 
oe ee) — em Ie eOLTt —  —-_—lhUr Wan e#O Co ae |} wt LL ollie OC MNwu— eZ ewe AYO 
mS et PY A et OE ed ee Ses |] be ~— mt Oem we YM) II a Ail-= O—||}ao " mNxXet WO Ox Dit Neti wil 
= Om) ey Il Eee ee Was Ww GC OkeaeeKe Wms WC Trees <I KX it a fp eee INL OY II VE LL fame mm ee om 0} 
wee fe er ied ee LU Yee YU OO Re Owe Tew YK OW KE Oe DPR Ree ee Kem OW ZOSOQOR eae een OWA ZYD 
MOOK FN ee KO ORL KNOY Z+tOM AOw~OeCXKE Ee OOo 2z +N. 2 GMXZOCE OS + KON ee ZFOrZzeL 
Le LU LUNI Oe LOO KOO Oe A OOO OI erie OC tL HO OO 
O™7MOM NO se Nw OF SUT Ce OZ Ze He DOR Ne Lee OZR | OO BR eRe OOM sD Be $NSZUZORKNZNONe Nw ZIZOZE 
Zea z= COUNGZ~LIIO WA ZEO li —|YH~—IOWaA Zs il Olt NZ b— ST DBD SI SS I mur eS HON ISHS 
Pmt LL mt (tt tt CL et Le tO et OO OLE OL Le KE ORO YOOYLOWOO™ML HOWONON HLOCTOCOT LL SOW e TOK OO 
RENN eee ee ee ee Fe a dea 
\ { i 
© © mW oO ei { { © eal uy (or wonm WwW Oo MY {© uy 
= AL-.- NM a8) t | Wn Ly Ly i- = - - OD OD | © = 
= _ ed eed — t | eH ~_ — a ee ee | iN N 
tf { i 
OW WO WO 


ay 





LU <{ 

i} ¥r _— 
Tt NE <{ 
fame fm OS) ramet a 

<I WO LL 
WOO eS LU 
YW @eNE © Y 
CNAIMNYO <I 
LL LL med EX LL LL = 
>rO an 
<—) ee ‘an 
AOrWwetivW a. 
E> SU <{ ~ 
eemeY © C 
OO emai iw LJ ow 
er WO ee D © e 
QOwWwWoaoO < O WY 
Ole ae) ce LL -— 
wiCNOde © a WY 
ee Wo © b= pam 

ib LWW OC WY Lu <— 
IO awaew aie LLJ om 
Ke ¢ Oa ot = 
IM me tL) <{ >-— 
Cy Le Ly, = Zz Cm fj 
ODRK ee fm —_ EST = 
IOr < eo ee Fy, EL) 
ee Lene Lu f— so LL = 
MY OOK = =z Zz - 
ZOkr <ke CF N Cie LLC? Ba 
rear ON) LL N ae ejlU J 
IiwOwW < WY e < WwW sam « 
- — <{ J VY) = a -eeO WO 
SS aie Zz Ww Ww Nee — 
Osw<entw rt SS — Cd mw 
Omtawa Tr 2S ew qt UuUDMe ais 

Act Ohne 2UVeag — elm OF 
Cis oe) CZ AUMUW = aaa Ww = 
Ore=OCN Mw IW UW © me Ow Dat Oe Ww 
 <f Oa O mm Oe Oo 2 WO2z 
IW WE O Wi eOUOW eS AT tlt een) << 
SIOYCMAW oy oJ Fee eYD <Q ww — oJ 
aI aIi a oe <f + O BrMBewszo FAI FS TFT YODA 
= = s> =Wt DDDAINr HWMW CWO BW yw 
MOMIULWaY < ile 222—- I] HN I259 jw erie 


OS we + Dew 
PNW MMODODOMMMWO YL eYODVOZ 
Wee SW YAMS YONA INS Nw J Sw ee EIEN 


Wii WS yt WOOOIR RR CONIOOrFYIEONw~ NM CACM FEW SE QIN EFr Cn I 


GO TO 275 
O 
R 
L 


TO 44 
ReACUM( MK) .EQ.CP) 

I 

t 


GO 
0 
e 


AR) 
MMA. 
DA 
ST 
SE 


k= <I e e 


ee Lr eO 
om em jal 
SN par Ge 
== OW RLU as 
were fe Ze 
ASN COM 
+>] Bivdo 


ACUM( MK) 
E 
E 


(J) 


wv) 
Oo 
oO wr 
© 
yr © 
= 
©) 
ke OO 
wy coy @ 
0 © Oo 
+ O-n- + 
al 
OQ —~ = © 
uy b— ¥Y~ at & 
wv) Fi 
LY © qt OM © 
Oo ad e © 
O ~ o Oo 
od ay = e LJ a 
© Zz LU e¢ © 
© Yom 0 a rs mom 0 
M<f elu Im om e ell 
-_ ~—j") @ —-—4 m= QJ) © 
cro AONe- SoZ 2 NN 
Oe ilwrse CDOwr www SS a 
COTM—KOVVON 'IZZD DW KKOOMON Il NON 


De YOOOCCDS WH BZWMNTeWWOw ~~ ORnOOKReWWO~ ~O~ 
Ot Sate W OM) It prOtsZZerseewSaSZZZeazaZZereseee YH 


KUIxKOW I~ ~iHwe ll «KIUK KY 


Deduct S SRC XY LOWeWYdi  e YULOW HW MLO ST i) SUWOW SL EOWMIWZSQHLOUSULSOWIWOW 
KR WENO ZZXYNACVNQDOOHONNNANOQOW ES SH QONMMNI ES SHO OWE SZORM”NOVOHOO ZOOS YCONZHORMOMsOOZOZOZO 


Pott | i 4 | 


| | 
fbrreypltoe ' 4 Oo | ith WwW WE 
bebe d punw | oO | ;' WW Ov 
Prbrrepam ' | NI rie oN NN 
Prreddbe it { ian 
SlSISlISrOre) WO O Ow 


“uy © ea) Oo UV © ~ 
LV OO 0 oOo a oO oC 
Se ee tt ot + 


80 





Y 
Y 
<I 
oO =a) 
Ov wo) 
wv * 
ae 
O oO 
- bm 
la 
© — 
Cc z 
= 
— Le 
ee LU 
2) CQ 
e ~ 
cr b— - 
LLU ce) IW oe 
e CO = WwW a 
-_ iw CL Ww 
co a CG) OC a 
Oo = © LL tL -— oad 
+ Sd + — =) Om 
iw T=zO Tu NS OuL = 
O wv SS) © SS LJ J © 
me N OOws OW Oo ™ - WW aDO 
he <I b= © i N= Wwe 
ee “© OQ 3) II = LU <f m _) ° 
Oo - Oe W OF -~ -_ MS C= e—O Y 
OC) Ly Dj — Ne) Zz FE © l= Cu LJ 
— OO ee N @ Ww © “ <I Lh as n= 
<r © om <I C9) am DO Nir iw) ee, i, es O. <i Ow FE Se 
=. Y= ce we LO) wy —Y om © =. IrreO OZ 
=_ = Is— <I-— Tu — -_ ® LU oY Sone 
=o. Oo. = 2. - -_ © ~< ht Co = ee fer OC LL Ore 
> -@ HOG Te MAO MAO —wd - LW as Ww © Ow = w>NY F< 
° e eee © e 8 Chk eed Ox ee e ® fs — ONG = 
ox uy CHIVES CoC A YA we © oo (CS) we wo mom CY QO w Ont Ssz OOm 
HN Fase 1 8) Ww LW Lu lo I ee <> m— Il Ht <I hr Ww we Cee awe O@ @ 
om e e e® 6 @ = e @ wt om (9 am | >= —_ — o~ wee il Co) —™ om & ~ LL # 
-=O- = —~os ° _~— + = 0 x = << wi OFOm Jjme = —=O KK eOLDHO est = et 
SS NM ¥Y¥Y¥Y W YY es — ye Ww +4 jm D072 ww = oe MOS MSM IO es TF HS 
ee ee) eae LL) Sa NHI FD ON ee NW NAMONM ZZ KMKMOOWON | ZLNON eZ NOO Il lil COULO © wd OC 
NY Il ~wO wwe 0 wes OS +N 2 Y il ao +t bh CNW NG NO WW we OD mW ON HW OO em eM SW 


we Pm SR ENGST TNE SASS NUN Re We em UN ee Ue lLU ff} RMT ROO RRO RWS Hm rem es mR Ot Ow eZ OO 
Xe O+D +€DDD BO +¢DD OCWKN$+ KR wt OH He KOK RMR FEZ KK COO RKRK COm—MamRKe OHS Ne DN~eEA ep— 
WY ew OO YUOOYUUUOD IlLEAYOVUOOTWNW enw NODS OMWY Uwe We Dm ee WUWO~W! ewe iw Ow WYwO 2ZOT 2uUte-O7mD eS 
OwS aoe deSadtet oe Se SS IIE OMMO Aw I IONE ON MOO Ow SO ZS EZ Fe I we SF Fr NOW ER NR OO UKE SS ORAS 
ZOU Iw ree il [| ww Rae ISUNOHwi WOwrszeMoUZOWZF IIintihn lt «KUKOW KYZOW eth Cee wr 
S$ TEU YLOYUILULOU Dae YL OO ee I et Oe le SO LE OW eS OW ee STO TWwC uO OUD 
SO et Se OS tt Ot SOS SOON SNS "NN SRN NS SFOS MON Z HOS OOZOZMCOUO ZOOS YOOFrOONOHKH SW OFOoO 


LY NM oO eS uy in © Ww Sm uy Oo © Ou ~O uy 
nO DO oO oO Oo nm + + LY oN 0 Ph Oo oe Or = 
+ Ln) Lr) Wm Ww un) uy Wh un) wn uy 0 0 00 Se) 


C----- 
C----—- 


81 





? 
M 
1 
PEQUEST. COMPEETE*} 


SetlieeQesTAR) GO TO 625 
C 
G 


hum ae it 


m= Il 


~< 
= 
® 
LL 
= 
e 
try ae 
~ ~ 
ae Y) 
Y -— 
—_ a 
= x N << 
iy UW tee 
Om —-—= CO OKz 
Oo - za Ow 
OM e-— me OMO 
ee eee. 
ree LL e aAL 
Tes OO ye 
Omer Om 
“lj <_ -O 
on ome ) LU e emmy oe 
<i <x eR 
oe ee er 
WOoe<_t W WOT 
ae 2 7a we 
elUu<t e el « 
AaHNoeot = Nt = 
LC) wee 7 iN Nw<t 
~ mY) - aa © ron 
me DORAN 7 
i ee MNO ee WY 
we st ~~ we Ow lf) — 
KOT «K «KOT 


wer ee et CY 


GO TO 660 


SE+1)-EQ.STAR) 


a+ 


Sls FE} 


N),(SEQCIX),1X= 


’ 
) 


LW p= 7) 
=<< iy 
<i _ ~ 
zre< 
=. oO 
bm LL = 
Ion 
eed © 
mn< 9D 
ow 
aS] ~x< 
qo —_ 
Uk e « 
NEOoIWN Co iy ~ 
— OW LW N o 
Wu <— 5 <I 
~OYSZ LJ ~ © 
Ct <I WW) iy © (vy 
LL oe tad — oe ozs 
Lee aap - = 
WO <a Y im © WY 
mY Om cm OD © S 
Wire wT X< sO (vr) sO oe 
Ts<t om OW ~ (vr) == em tO 
bow, CES oO QO Om ~ Ww oOo os 
MOOD oa O24 bk ih Y) OMe > 
mae tt ~ _— = o CO o he we WNW 
mort © — = 8 ap -_ ke mat oe p= VY 
aA li © e On Y Sha 
OOonmnw m= (Oo — © i Omir fy) 
Wiles © - © om >< © — “WO 
Oe Uae ets Om «Kx a - —— KR OWN 
O meelYszE OO Phew =O az ce med ae 
Or O ©) ele e @ J a <I we <I TON] 
© wWwetew Bo er o> -— CO WOrwWwat 
me A0>ae — mor Wl * @ Y Ha we Se 
oon Y Wet Oo - LL o¢@ Ow ° Nev euat 2 
— © ali Oo = On —-= Te pa Oo we MN >WWW 
—e CC) Cyas se ot (Owe iN ar) LU N Nec Zzy 
= YetmOooa = ~ o ~~ an eae ° om ja 7 mt) 
za =—- OO TemMWw D mem I NNO = LV ew “emer SW 
Qld SereETee |} TAO NM ~e ee re Mm™~ OM #« LO 
IE th ebNee it — i SIO «KK weer =U OO eNtinaxm™ Lew 
LN est SBA ZINA we Oe Us Oman NW MOeNW esUnst 


OeW<t Oe OeWIOe Ee WWOWTOMrFT Zot awow 
ONNN ee See eR eR SO SN RE SO SE ENN Ne ZR OO UI IRE evs ret 
th Ut wer tt 


mm O02 


ea 


wee EOO~ODT 
eee + —- Cwre & 
ar eee WWW OW fF Oe & &O 
+$QOOONMN NE k& ORE ReZNNeE 
mee OI ih 


> ke = 


OO be ad <e  S S 
Sal ee et Oe ee OD eee fl Il 


RPWLWwOCOUWZOCOrFOLGYO «OW LYOOrFRWULWOYCOOOWRKLOUVTOYOLML HWOKLMOULE tT NLL WLW OYvOOUdeo e «Oe 


DY) wes Y) CD TS Lb mt) OD SLL CD ee TEL CD et 
eel 


UV 
M+ 
00 


co OS iNO 
Ne) 


ae AN 
Oo OO OO 


eA 


INo~ =O nm A 
ttt In ow 
O00 © 0 0 


oD 


Om Om 
00d Pb 


00 


‘0 0 


ted SLL DY) et OD TS CD ODO sh OD et Lt ED Dt et et I et OD SLL CO ELNINO 


Pade aN 
lo LA DOOW © entaeeonen O° 
1o oo OANA a mm In WK 0 
i ~~ eee - ee ~~ ~ 
SOO 


' | 
i | 
, | 
1 | 
OO 


82 





N 
J) GO 10771 
) GO TO 765 

L 

S 

G 


ore 1 -—b-~- bem 0 


ANDGG OU DOOWO 
ee ell © (Ow ew e& 
iri os =_ +~- OEe-e 
Sa eewannmw WNYC MoS = 
XKKOUOUOrWD owx«xx 
CS Ly EY ee ee eer f= = CCI CI CO WIL 


eet SLU ORS De eee WOW TOWTO WOde 


SE+1)-£Q.STAR) GO TO 780 


Sy oc} 


Cie 


1,4) 


A Etna 1 oe) 


~~ e& 


OD 
QUESTS. 


pa 


Oat 
Over 
OC AO 
pr Ped 


F : E PROCESSES HYPHEN AND 


R 
0 
N 


eA UN ei UNO <t [| = 


= 


\—- 


Ph Om~ O0— OO e ee 


0 
Q.-OP) GO TO 810 


om LW 
e oO 
ea ome 
i] bm 
— a 


= 


OqgAoer AWD 


hoo 


SZPSEDVOVS SEL RPwLODDNNNR RD TERENIOFF OY OL 
moO me OUSLY OOK Ow wee Fe OOO ZI Ie 
PILL SU SOU ASU eNO YOOCOOLTOOWORNOUIWOSTUMNL CO OU ZSOLOOYOLUULOLW AO - Od ZEeauLYO o~ 
Stet Ort St OO rt eT eS ee HUI IO SLL OE Or MOCOULCORZNOO] BULNORLOONZOUOR SON ZLNNOUOZTOOOH SUN 


{] ar eer ff ae ee |} Li er er er fff] ee tf 


1} 
Pitas 

0 ee roe oS 

~ teh ee 

rt 

OO 


cO 
~ 
~ 


om 
Cw 
en 


UN oO 
CO 
ae 


O 
© 
© 


1} 
i 
1 | 
i 
4 
OO 


801 


o~ 
- o- 
ow baad 
- > 
~~ VY) < 
x< QU <{ 
aad cn 
e OoOWM Le 
Li Tete — 
Ze ~~ wt & ~ 
e Wi) WY) 
— ~ _ Q 
Te LL. ee 
~ LOM -O 
~? SS -) = = 
Y) eeu oo > 
~ -~ eau Ou 
x< - O ~- om Wood 2M 
LL us oS N + cos ww = 
Oo —~ = OC CO VY) =D —~Odw 
z oO <«€ <I 0 me ee Ue Ol 
— MeZ O oO iad zetia os 
= = b -— i) LL) e ob 
e alee I i eoeTWW —O oe 
an) i~at OO © MOO NY —$-ro O22 |S 2 
a el = Co co I-eS- © ¢& NN — C5 
<{ elo =a) WI CG Gs “Na 
ss =] ee — OO O DAW We an 
ON = COWL XX ~ Oo —- Gwr ec Om => 
oO ~~ OQ ee ~ We nw One LL 
e TOct e <t I~ © OO iN a rc eM 
Ou; Ome Cy ed ZY DO wexKS OW OU «= 
aa CU OW co =< elim WO — Or Ni 
ae e WJ <f o—_ o Om — ec 2 CY Li wat 
— Om NSPZ WN Co On Ww 2>MnO NWeE ONMAHSHY) 
— COU wea LN & Li vaHAee OF 2a2auWw NEO (Sw 
a - mg C/) e > Oe TOO OO msOZz © FT on 
= Oe re OY) Wie NWWw Bee eS We om oon (T) CO) 
OO DONNY eon ) we os ~ 0 e eO eect eNO Lan ew 
I UO em Oe - =x Se adeomeer Re N Se Sete UO MeN OO 
it COIK OLDet KWIS> IH wo Ia WIOrNSZ qe COOmMLoO 


jaWWWO eaeltWO etme Wo mew 2 eHOOTUUOO NW eA 
= 7 8OFOwMOTOZY FE D ZZ DMNOKMOAOTMN OO ff eOw~d 
W2EN EE OO rE BOM ZOC DD Sade reenw ew ONZE Nw 
De SeHOWT eOweSt NO Te HNOQOUR IW we eO I zui<cl e 
Oe wet OR SNES tH PO SCWR NOIR OR LDNNE SY OOWTrRaN 


= = et AJ Saal 


O 
LUV O un © yt) Fo @ IN LY 
Oa eo N NN NN ON NIN N 
CO & © c CO © Om ® co}; oO © 

' 

) 


83 





- = > 
LL ~ LL ~ 
O e- © Y 
~ Cy 
Cc > Ge oe 
-_ SU aa Ww Oo 
ce mo <a co = 
< = = > 
~ Ww Ww Lit » uN Li 
VY S ome == — YY 
e - & ee cy 
o a <IeeWY <I> fod 
LL a >~O Ge Se) Liu 
° CO ara jm LU - abs 
— Own => = 
— =) Los =< ) S) tr 
we le o> oO Ln 
— WY WwW Mw (5 (m GO 
co wu) ca © mt LL — J ~ uw = Wy 
<I OOO) . 5 uw ale -— o= ww uw _ — OO 
_~ coc -_ N CO —-EeE Ww — meN ‘oa aa) ne Li > -~ 
W = r= Oe ts ODE tw » mw ~~ Y LL © 
Ww Oo ~ oO mdz Moe -~ moO - Y) ~ © 
—~o2 a~Re * om HE ie) = ein O FF — — wo 
© e © = Ogst sao OO onoo-e N l=) © uy k- Y _ oe 
ro Soa i k= LL CO me ace mI zwe wr) _ HO mr II I~ ~- OE 
ool an) Mc il e CS me DWIOD CO tmnt =) pees = i 5 2) < Yew ee -~ 
Ww e ee) a Oe © “Oma WS eOUW MO Com CO CO ce tt Oo OG es 
A ——— = OOr- mam em iL Le em LL 1 O oh Ce Om -— ~ WwW -~ WO 
UN <f WIV) O — —AO a ~NMOL UN we YM LU =.2 vo -~= O a o< an uw AO 
a=) CO ee aL @2Zzo Ww Getty O26 ty Coc a YO © < — = Om O WIS 
5c SIE WO za ee OSA WOU D —_ <I a ~— wu rod OJ Womic 
~ @ *LWI LW ~~ = oo Yr ee N= “Ye > O— a x io = Y Cy<t LW “Oe 6 
NG Wie e Ww Ooo UJ tte ef Nw eK Ww ZO moQ ae ° Lu © CW NAO? 
ma NW Be Noe YA—™ @ @ e © NHNNS CVNNnNetz= —~ Ye ar) Ow 3 Oo Wm \eY WN eWW 
- ce <I COUUY ww OCU? —_ ec Y eo Owe Ss ao ~-GW OCW = ee G&G LW — © <I ©OYe ee 
CO We ~_ ot a LU Ww Lu N Omrno Ww Ord ~ OW MWW DOVE? W e N OM New Wee 
CLS ee bes Y ~~ — De @ e Xe ae OIOW we MO “7 «de ~ ee eM WW w = ae NZ ee Se @e NYY 
—_- arm I ANY) OQ Neem m— CO eam eto ell WN WY peommemrtmm Hn 018 ee ee OK om eIOestl FHAeew 
YQ a2er — aw WN wSSseszS=z -—O On 2H — er i SS Oe ee mS Se Sesi‘H’ uy a0 <a Zilt~~— 
eS <IOoO os IxK OF KUUUULOZ~AOTWOe fLoOrw KX QOOKLLE KI OOO W OMIMIOML <IYvoao 
+ Od oe OWW OO aePWUwwwe FOO es DW OO est Te WwW Ph see me LS nme LE LL tet er eee Lr er Nr YD Oe OR I ert Oat 
DROS HH NOO MOTOSEREFO~RMDOMRY NO~RH CO BRRrF DODO DODDERME ~AMO~M BO~H YN Ke 
W COF+¢NYe CZZ ~~ ZW OF Shee WW Gere e FZ exKOMCMZZOMCMZR&K W Ot SR Beer GNU 


COU me COO RI WO ZW OW KS wm oO me NN et Oe Ree WWW OS Re WW OWTOMNe WwW ew KWL 
IQ VCOeCr YAS Sse ee SOO ZZ ZR COKE ZNNAI IR ENN St MOZZMwwe SMR ZZRKONNN RK HSE MORES NITIOY 
= th 69 POS I eee Per ee we |} meet Re em HY Oe © ee Pewee ow ee |! LM me th meee eve it =~ 
mH SWELL eFOYCODSUUNLSORHULOYO we eI YO eM eHTWRERLEOUNMOZPOULOUWZDORWUWOCOOOrACO eK SOUW 
Im LYK RLL OO SOU ee OO OHORUNN LOZ NN SOR BOO ROO ORR OMOONN ENO SU OOU SULNDOO ee 


N ~ aM et NJ = 
S rm Fr WOR Nj co Oe oO w Our om un © mo Wr O © 
om ) OH Os rs Fun + NINA mv) (9) Sop Tuy WU O uy 
0 © co om Wod ©O © co ©€O oO m oO MOM Mo) m wy MM MM MH a 


84 





_ — 
ee — — 
© -_ = ae =< 
CGC — _ _— — pm cv mw 
oan os ——d tes = = ro uJ ag. 
<[ mae — — sole jaro Y =) 
<< < a — i) e ° 
OO we Vc a ee — Lu LL 
= Ge «Go WO <{ <x XK COO a) cal 
bow ets Tey cy oe LE ~~ e v 
Lu © a Lu ee a Co — =~ 
aa) op ei) VY _ Ze eee — _ ~ L\ 
<I ae path ® ® — jm Se ~ o wn 0 
ale ® e LL LL 28 at pene from 72 = @ Ow 
oO Loy _— TO C5 ee) a e Y) WY NX 0 
med © =: = 00 wr ® ® area GG ft oa — © ~ OO 
<I Co 0 ww = cow No) _ -_ ee coc) Ne) ~< ~< ad a oO bo 
oOo @W & : @ “a ia <I << © Li Ww xO c& 
Y © Wi Ww Sie) — — e e -—— (a) CC © 1° @) Ss) be 
—m LJ -—- OF Nn MN be b= G) CJ Co -_ = —— © Ze Ze es) LY beo= co 
OWN hose ® e — Li WW — aan = = — Oro © 
~-WY ay Qe) Qe) VY) VY — — tw = = _ b= CD © Oem 
CO WI CSc 1) Pie C9 OC a e eho aie ee js) e ® MZ 5) -~ 
CIW) e Oo <a << a3) oe ao ONOea Od ©) oc Ee < ee _~= 
OWN -~ ® e ~_—— © © CO¢esy ae es) & <t Ok _ at) 
N Veve@ x - _—_ = ~ e e WOoWwWomd Wo _ e 8 Pel © a= 
Oo LAR WY ODO OO = = ren] ~~ ¥ aw YT YN OV= SZ ~ Mw oO —CO a <IO 
aa} CO th e oe —_ fw et et <= gee z eC eOD ee pat a Ze u ac LL ene 
~ 5 1L) =) UL 5Gas ee as ae <f <{ <[ CGewWweHe GWde <I <I <I WwW =z °° OOM 
2) “O0m~ SF W UO OO Oct | onl — Ls 3c LL Li — = = a =o Wy Ore 
Uk NOW a @ ee YY aco oO oO cO “CO 0D ee co cc oO e ole) Ww ABO? 
— i NOG re Lessa Lon LJ Li Wo. ® e a om (Dom 5 ee e 8 ® — —™ @ @ ® eUJW 
OO <a co OO UW ON MY AD ej ej e] — met fm er | Co = OIE —- See 
ZZ 0 —_ GLE WIL ” ©.O e e ee LJ LL Lu —_ am ow LJ Li Li a eWJW CJ = —_—— 
cc Y —OO~7~ =) DAO W OW ° ° e 5—_9-_- > e ext er - oe - ANYY 
| ic | ae FO MM MMW CO UW Uy _ mt ast Yee ye — —O —O WY YP) we ome ome — Wow 
—— — z2zxvaagc ~ ~Y e e ee = m1 0 wt SO er ee a ee = 4 CO 4 COQ tee we SSS KDE: Ile 
— © im Gav KR ee DN eK Nm emt ae 6) wee COL) SOI EINK De Oe KX ON DOOVE es] YOO 
Le Ce Cee Oe ey, UW ilk i OD a DO Sls tey = oO T Osdtngqgowwo To Too DO W Miww~wNOODOWU-T St 
Deel es OO CO OWS Ow DO~wwOD Ow UOO® WOODOWOLeDMOODODW OD OFDM Oks ODODE RKO~ IO wh 
<_+¢+¢O e+ 0K CWOe2l 2 200 GW OH oc Of Ske Zaza 2ez ox oO eC +a ZwxxKX OF MOU 
rd ot EN ee et] LW CN = SE ZWOWOOWWOMRWORRA One OR De DOR Ss ORWORWOORWOM Oe SeWwWwOZzrOoowu 


moO YOY SENN SNR ONE HN be + FY $e ENO $+ FOO RS eS OR ER HO HNO HNN SOR STOLZ SZ RH COrOrE 
aNhOw POLI Qa SW eee el meee mee eee we Ow UI we Eo — I~ j] I me meer | j| =~ —— 
Om WLW TSS MLILWLOWOOULOWNNLoOnnh Ov Whe OUWLOOULOINHLOWNH Ob MILAODZPULZTOmLOOWWL 
OO Oma OM OO I DD ee ye me es vt CD me me ee eet ae © me me ee eee OS NOOO Se OO OS OOS 


i} — — a a= — 
Nm + N PPO co — oO oa 0 S = N ~ 0 wn rom a0 N 
iin Uy Ot IAN in in Oo a 0 0 Oo ~ Oo ~ = 0 OLA 0 ~ 
Od © sit co, 0 CO co a0 CO CO co CO Ce CO CO co 0 00 CO a0 co 

If 

Oo 


85 





Y re 
eS) Lid 
ow -— 
oS > WO 
Spl UL < 
~QaM oe 
LLJ C2 bem <[ 
~OK- oye 
<< ae 
~ LLY) 
© oS 
iy - wr 
2Ye - N 
pits VY) 
St — VY) 
a ol Y) Q 
<I ap W — LL Li 
oo 1 - a Lid 
OWN - + b~ WO 
pont peat = x 
=> TO se LJ Li 
pend frm poe Om oe 
@ lesa ~ © 1A i eo LL & + < ale -~ 
aes Cc co Oo © 1° @) — > oe a. b= = 
bond © © Oo ® © ee Oo -_ > Cc = 
oD UW bet LJ co Liu > SS Zz © 
~iz © ea) Ooo Se t= Y < LL om LL om bm 
i he ec © — Fa OO JO _ 
eas = ee Zt to <t 
pent LL CY’ © © Sere) CO Li. Lid Yew > <IN aN © =z 
kr cy LU CD a3) CoCo ae CS cS ele CO lt <t = J oom re © i. 
Ie — — © m=O LY Le {foie Ors ~ a 
uot hi ~ ~ ——_ om ~ oO eM OO Ila O Wine ee 
Sno ice coe 1. see ex 2 —_ ma O 3¢ Zr wf Li pet Hi LJ 
Lh Of < wee SL <t Li Cc pot > «© Oe hm b= 
ONO «OK be Ory -oO bh be /) < ~N © + OO -O 
— rip YY) UO) Y) Oem NO Y) UV) bow = T_— 28 ped eet fae 0.0 fee bm ome = 
2. = OC Y ° e @ “ . Li b= Y Lj o + rm & Co we te <[ 
OO = AYkeCS as Lid Cr Pie Co at = ® Ws OO - Ox~NO Ovow cx 
— ane Uy oO _ a WJ ae LiL GY MO & wee (Da = Cero reo - 
Oo Nn Fe ZzOe LO bod 4 a e e Li ~ UW WN bond c COe Ye ce © 
=z WMZO>-O=—= EF il ee ~ am mm O oa YU) ae e ~ e m iy —- LU = LL! 2S == oe 
OC Um etm — Y WIN WME WD me O Zz = DAL Mw om pa eet wee ot ww el -— oO 
lO et¥v¥ee Il a oe Cr. ve eC NeY YF & Oe 2M Oe ~ Ww *¢ ~ ed Oe 
m4 LNUOU ok et b ~ nam maown Moworo ~~ WW OO es %—- = Neer OO OLThY—NW COT We A nN 
LO ee Set ea WO <_ I< < o™ IDesWtTM YN CDOw GWOTIDOTFM WO wet wpe 6 LN ert em oN e& IN eb 
Deer OO OW WW pe ee wm Uy eet fe em Dt fp CO ee OE Xe HOO WL eee Oe Owe Se TOROS eC OO OT ON 
Z+OQ OF TD *O4+NMN eX +4FONFOUF0OW + O SR fF OF B+ test Rae SRR SRK OSE 
mie LOY MNO est We cf <f Hl OWYNWWYW (OW WOW TO OC Re WWOW mp SOW IK OWKIWKW Ow tel ie Ot 
more YC CK eR KH BORNEO Se Ce eR SZ RON Ne eR SO COCR Re SSR Re Se Se ee Re ee ROR aE eee 
aWO OWtiwns |HOOO WZ Hi hm ww Il ww (~~ peste lt thm Ii mt Ot = I me OO Oe OE tO we || pad OS ped wee <L rt || me CC OO 


Cm WOW YW LLL WL WON LEM NIL LOM HLOCO OF WLWOYOmMLEAONOCOWWOYOCOUYOCOCUWYYOCOrS 
QS OD a LY tt et OE OO et et et OO Se OSL Nt OD Le OO See SOL SU WO SUL Sa SWOSL NW 
om 


Prud 
Or Pye To u © MANIA © MN uy oO co OO Oo INO Ow On 
Oo PyuIie © co CO C0 oO On Oo Om nO O wet N Ta) Oo 
feotoe) ;rsrsweo oc oO c 0 © c 00 oO @ co OO oO oO oO On 
pird _ 
OVOO 


86 





oO — 
_— <t 
uo ze -_ = -~ 
© <t = <t = <{ 
fae as CO ae 
Nt Q we oD = 
row © e & 
LL => Cc) Lis Co tis 
hm «50 o ze . Lu 
2 Co ’ J a 
<I = e Li sas ti hk 
NYA WYO e —_= ® 
“TO _ oc _ bm 
@=() «YL —_ Cc — —_ —_ 
~ |= OF mre © << Cr. = QO wd 
Iv“ tTOz ow rol e o O mw oOo = 
a oo eee LEI <{ G) <I O —Y oO — 
=OWd a OC Pads e ae et —_ pwd 
Ouget +> (a - <I oe aa 2a 
56 Hee © ° oO =e ZY sr 
eee LY ° © oO ° O-es 2 O «= 
“NY kD Ge cD WO ce eee = ao@®@ 
= C©>OO > ° O QO Otw eA 
mt el <T ® _ Li e ~~ & eC) _ - 
NFuLerr> CF : © z= oo Nam Nate 
“MN a <tlWw Lu © - e Li OANWY Oa“a 
~~ mY TY ® oo -~ ~ OO ew lL 0 o~ 
wy Mor Oa _ Lis ery be a we CF wee CF 
mom <p LL es = e = Cd ww bo om et ome oT 
FNNW Cre —_ Oo ~ —_ mite eco moO © 
LL | OW OO © ad -_ So £6 oe a $40 wmf 1 ~— 
eadyell <a<zZz bk x WO Oo & << a <f td OY Lit mi or 
OC we LL NO oe LU << -) « <t YW Hh —YJ 
ec ee Om WwW @ © «6 © « oOo — © OO © e oo py > 
IVa Oe OO oa Mo m TF ©O OZ Oa Nw lO az Nem to) 
wted= Yea o& 2) «OQ -— = <t & ~*O>—— %+O>~— 
Saree ex —- Oe Oe Oo = —= Ce © e aA OTe xOaG: 
SCWNA WWW QO -aoW cmc moO OF A Ka  O. RlWUO eOl © eo eALUO «OO 
Tow OZZNO Gc) &) «© ew ¢) © © ©) bet CD eee OL bm ™ ” bh (= Ob 
Two ee » Oe © e uw e e Oe Oe ZWwaA QC — = ex cO LL LL CL C2 we 
Ore aS & YOR ii OC = sl tb “te cote CCF mie Oe f OO oO OrOry 
2aaew DWC) sw Pah Lhd etd a Noel a ae LU ares WD 0 <O UVC © ares“ Ww 
Lm ee ONS ee ~e we $m eZ + oo —™ «6 — © tes LOW & © e+ Il Lijm = =LLIC) 
=z ZzeYtatoal ec ama am YL Atfeat ary — — we Jd az 222s Ila a. Bees 
mr ON OR bet a emnsOMC wim Fe ht eed em © em LAL = OOW MOY Omi Ila mtr OO LW 
pm et et OL OY? I! OO ~~ We OK OD OG we we We We rN Pam pat beet Sa, a) TD) et ff a et om Pm mt et Sd 
DONDNOAZ Tr || Oo @ OY it OFC! OFOdg ec Oe Ow z= DUONZIN RE ow HOS BON 2 1 
Oma ws — i AOS (CO m sO KAO MAM (KOR *(~OORr Os ZO> UOUORbFO~~me OmZzoO> 
Cw TS U\ OO eT mt IE OOK FE KCO OM Ke $d te te SE OL eed LW SS) Deter TN ZZ DD OS nd LL SST 


IZ TCtTOT tt PONMRNLLOtT POUL ST HOMOMILIUIULLOHULLoOOowzZ V=zRKHOOTrOCOKOOOWZz VD=—OP Il 
Toy any an) ee ee) ren ee be LO) lt ee Ol 1 ear Ol el ll Ol olla ais, MOO OWE NOK LOOOdW Mr OO WO 
ed od _ | 
uy ~- © Lu Ow © © uy © oS 
os N AM em ng tf 0 ed WN 


bt 
| 
' t 
rt 
i) 


C----- 
Ce ~~ om ae ow om 


| 
! 
| 
! 
| 
O 


C----- 


87 





® 
WY 
= LL 
=e a 
LL -_— 
WO oo 
cy 
(ote in & 
= om © LL 
ae) og 
Y Ue << 
ee Om W 
Ww Fn © 
me (td) — wv Fe LY 
_ a. a hop = 
Oo —W Oo =— LL) cS u\ 
0 es oOo — Nr Ye Sie) 
— = —_ na os LL) = — CJ (=) 
—- —Y -_-~ -—> FL _ 
x FN < =O OW NWO O oO LY 
WS X< <f WS xX I <Y O k & Lo uw © 
pee eye UE) =) acu) Oe ZW © © 
e eZ e 77 O *K Y — © © © N 
aS « _— Ss & Wl We om C4) oe - OO - 
NOM NO ~O I> I - mo eB) 
CO eNW CO en <2 eat ~~ = = ce O - 
OO me NS O OC we We. Se 4 — Oo oo 
re eee =~ OF- ne oe aE oO 
NS MW NI WW N 2 «F#% Wel Xe <a OO -_ -_ (ca) 
fl eS <_ pL es } we T2 7 el ac -~- O - 
f=NOe IsNO <x BSO»Ow YM WwW Ee mlm — 
wer (J of —( J] « — $Y OY ’ # WY — — YZ 
ba oe pet oo om ome LL tl <f Ge Co e - —- FT = 
Nw tla New <f ) Neto Nr WwW Ww iw LL C) (4) (<9 <[ 
+O Oo— +MY o~ THOM NN] ® e a mr tt & — 
We eer = Y~oOode OL Nw Y mm _— — © 3 + ke WwW oO 
mRlWUO e2Sun) ) NLUO «2 OC UW we TSN Y¥Y ~ NN = HN YM WN e 
Ow OS & ~ PHO Ow MNOZz Ode ZO YM WY -~_ =“ OO e ° ° Lu 
ZwWO2Zz~— eH CO WwWwWOZz~— = Dor me LU - ~ oO o “lw CG G& em 
mbpe oOr FO Or Orazae 2reOWWe ea “" xXx * De eee ° 
zZzenwWwWDd0O LAS) © Zao™“w 2S De Mm WY etn “> e a e _ 
2 235320 OEee Ze ae tee eae xXx «MK BANK O KY SYS — 
SOW Ot it tte me OrW Y em OOr—aIOs Ww Wd lWwwo wyYy yYy~ 2 = 
fem pet pond at Tet ff om ee om © Se ee am at et pe SJ OO ro OrOdeMODR O~wwrwe &~lN OO 
DMOMNZXKY ae SOW DOW2Zst Fez DOMMNZANO + Zaha t. ae aria C¢4 Cae 
Oe ZO> OR~O~wrX OxeZO> KX Ou 26) 2 mae | eee Oe YW OR 
CC me SY ed ee EON Ee CdSe WS SCIWWESINDYPONR BWR SAS RR tEZNLN$ENE tM I OC 
Oats De OW XXrEO OOF TSDNZEO CO OSS OI mee eee me fe et NIH OO 


PZ OCTOZZOWwWwsa VEO twa DS eHORKCTOZ VAY HEMLOWOWOO WH 4YUYuLNHLOoONekazZw] 
Ne OOWFOCOOZ2erwW Om OOW YW aw Sd mt CD COC) CO EL DDD) et Yt mt et et et CD et et TL 
a7 


2 uy © LAO ~~ © LY oS oe 
—— a OC a set et CJ uy ee 
est OJ 


C----- 


t 
! t 
{ ! 
| ! 
' ' 

i O 


88 





mr ON OL 

XZINOYWN 

Ldn SL = LU 
moje Selo C} 
aN Zz «Oey WO 
% le COL OOY 
NeIZ=U eYO 6. 
tw oO SLL 
e mle KeYOO 
HNOMNMWmW O24 
~— Ht MO ewe Of} Ocl 
per CO er YI ED 
eal ae Oe LLCs 
Oe he 
COLL CY SN et et Tp 


=a 2rteet OF 
ma Pm he NOI. 

poe ot ed OL. ZO 
ION ZO sO WI 
Om 2 O << 


GO TO 50 


ae 
va Wu 
lh NOt 
emSUL LL LL 
Ww 

= O. JO. 
aA>O= 
Ococ 
* eee 
COCEe 
LL LL) Li LL 
aeee 8 


1 et, ey em 


EQ.1 
GO TO 


AND. 
ORF 
UM(J)-EQ.OP) 


) 


0) GO TO 40 
O-.AND- F3~-EQ~-0-AND~F4.2EQ.0) GO TO 30 
1 Re 4e Oe ) 


od et pee, jad py ® ¢ ¢@ 
TD er ee er wer LL CC CY 
eS SSS DWI 
a) ep a) ee 0 8 8 
OVUVQUU SHAN MS 


YSIS Ie FPOOOOCKN II IeteLILIL 


DAS Shr Sli li il i it 
PFs OTTOTOIInNMS OW LLILOULI We I|OOLWOLWOOWWWWa 


C----- 


| 
| 
t | 
1 | 
1 4 
OO 


C----- 


wet eet ee tee ee ee te pd tee 


Ort 
et 


17 


} .EQ.OP-OR-ACUM(I) -EQ-CP-OR-ACUM(I) ~EQ-COMMA) 
Comiaes 


T).NE.HYP) 


N 


LL bom 
Oi—-O~—Kl WO Rm 


ee ee 
OMODOOVOOHOCCKS 
ItheNteaeEDDDD 
~ ~Z FREEO 


Osa OO 
YC ILI LIS SS rel Wiebe 
MOAfSSSSSDR Kw 
Ronan Coe ee 
MN OOOOVVUOUWOF FEO 


),SEQ( 4000) »ACUM( 241) 


YY 
UM, SJ SKyNySEQ 
E 


DOWN (601) 
HT(2),D0OWN(1)) 

f- 

T 

Uf 


? 
) 
0,8 
DD; 
7 AC 


NIOOZA «© ONMOAO 
}OOms er ete YY &- 
Ge SeEeVoor¢r WrIwn 
—O el e=- >WU 
Ne OOINOMY ow Wie 
HSH OLAM > ete Ze 
COww DI YNeEA Tt WLW 


SPWOwdT eZOOYWOO 


Ze YSN WeYW) DOOWMMD 
WrYDOSJOOZ el 
Za 222200272 Y2ZWRe 
mt OOOC Ru. NO 


R WHICH IS 
RDINGLY. 


L(LI=AVAILI 


be <I OCD 
ell 
ye 
LOMO 
-<[- e 
p= O Ce 
aWM W 
m= «=D 8 
<a 
ae) (a 
WW ¥~ OQ 
= mt 
Ww aS 
JO bk = 


) | 

att bay 
rio 
rit 
byt 
OOO 


QO 
XS 
no 
oom war I) 
and ZZ ot 
~~ = 


xs WN@a2oe@ 
WeYrtYr O+FWOorf 
OOTt Wa > hoOOW~ > Il 

>v WWOte Cu it WOO 
met YY IJ SOW ee OO 


89 





uN 

~ 

O 
IN OF & 
N ~ N 

O 
O So & 
- Fe -F 
O Orme © 
P= coe 

xs -_ 
COUN e CY <{ * 
C +O ee Wi 
wer ome LL CoCo 
OOD e LW iL) wv 
CO ——_ e.6©°? a= 
S00 anm-0 
ree) Jo 
ame owe eee ee ee 
ee RL 


Oxdo S>>o 


uy 
N 


GHT (DD) 


GO) 1G" 35 
GO TO l2 


R).EQ.0) 


ee 


Sel SOW WU aS HOS Om SOK WwuOW 
+FrOMer add tP ee IP ee eC OFt re OFD 


— 


= 
N 


Oz— Il w~ il~ 


uw 
N 


Wd ee IOS RO 
HOrLOOUWWOWWOOWVOWMLYNOROWZ 
et Ot IID OI et EO 


~ 
N 


© 
om 


wy 
~ 


Y) 
— o eo 
= YW WN 
w_f ew LL mm 
N Am x 
— Od 
> a 
=) YN > 
OP? e Owe 
<I iw Qa Ze 
“Y) ONreWwa 
Oz ae 


NZ ee Fa kreOrFO 
PeOYNW NZbe 
Ib Ow etjour DM 
ww © eS WR Oe 
—~ Ue re LU) 
NONWDZzeHdace Sw 
HM ew ee OD We 
we Ye cle 27 
meWrZ JIAadOMWNZ 
ZOOKTDWY & Dee 
WwW N.S eUOpe@ 
Zar NTO eh SS 
ZWYWilWe fee CC 
We Yow Lue CO 
2a Zoe NOe wW 
mt P= Of LL NZ LU oO < ear 
from pent tt NO <L l SADSNORZS= ADTZZAY 
DOMNZZOOrZ® aOOoD+D AtDD t+DDOD F&F] 
OeZOO NMZDIH IW MOWMOOWCYOOYYUUOVOOOR 
CJS Sate DON ae ee aed eer SaaS weitboOD 
MOSSE YOOrtCkRIm iH FInd er 
wea Oat Os2e oOo ViLOUvVILiy VuULuLLOoOWww] 


GO TQ 10 
) 
GO TO 10 


K).EQ.STAR) GO TO 30 
K).EQ.OP) 

CP) P 

STAR) 

OP) G 

MULT=1 


MK)-EQe~STAR) GO TO 30 
MK) 

MK) 

E 

E 

E 


(MK).EQ.0P) 
TR+1 


K 

K 

K 
NE.O) 


( 
( 
( 
0 


S) mt CD) Oe wer LL TT St et CO SL LL tt tet pe C9 COL 
rprrdedl 
reed uy © Ww o& LIV 
tdoepdad — et QA NO 
riree 
baboile Nall 
WOUUUOW 


me o 
-~ Y) SS 
_ o LL oY 
wtf ~ k © 
N - LL 
wo } LLJ 
= - =) COR 
= ~ Gao 
Wage aA ell Oo” 
Iw ~- }-ero 
mM Fes 
—_— & ~* Wis 
Og Ns 2. Dw 
oer « Co eT WW 
ox a 
TY eo LY) = 
~e \N Cw 
Com xe FOO 
wy 2 ad) eo 
We ft 2 
o> a mc Y) 
— D> ® =< 
COW rf NOS 
eI “N OrrD 


+ er eNOS 
o amo lL a o ()<f CY 
H+NOONTANSIOTa 
w 3 (OM) & eo ow at OO & @ 
weer YE NS LTO 
Qwa27rZvZdaaawidow 
WOOddUod eOzZ we 
Wr eS N ect es lLlb UY 
OR OWN ee HY 
zao”mMiWrne XD @#» W 
LL} tat exw ON O 
2 2202 ewe YWiWUO 
m= OOM Se OZ e 
foo peed Pent prey A, SN OL SN TO pet = OL 
ION ZZOANNS 
Oe ZZO0O O CN> 
YC HWW SITONnNWovoo 
NALS SVSee wep TW 
2a OOKt BPWODAN 
Net DOUOOD FOIINM 
1 | 


G3 = == 


maak 
pid 
tl 
pit 
rit 
OVO 


90 


8 o 

- cy 

pant LL) 

= # 

Jeo bom -~ 

NS = J -— 

~ = <e) <I 

pot LL) WO — 

© JO Be 

- z O = 

Wi<f - WO 

eS Coe <I 

es) =z CO ° 

Iv as) ee 

~ oc LL © 

Ge Oo eae » 

> xo. oe: <t 

6 a Fe C a 

LL = e eo 

a Ow ,) O 

- =) Ze Uw ae) 

a =— e 6 

LU UW Co 

<I Jee Lu WwW 

“- Rh < ° 

= = — oOo— 
© <2) Tis = Wy ~-E © 
Ta) i WO Op =) om <I vf 

~ <r t(D WO Ow 
O e Omw < © -= O 
=) -— m) k- © e - > - 

-_ aaa =a O<< ce OW 
& + <I Owe eS Oo Odqt © 
(=) ep) Es e CO Gia e ep) e 

O Nb <I <{ <{ —w 
am NO ae -—CWN =e ~ ~ ¢~O — 
= OO am = a) nS -~ = oe % Me Q 
a: eae -~ O © Le Se) = - FO > 
CO rc ew Us W ONS I WO <— tt ees 
e “ -_ <I ® © pemt —_ é e o « @ ° 
UW Ce) SNe we «= Us 6a = & er ] + CoO @G 
za Cg COs ya ae a ee a= as Uy er till UU 
° ° Ocd< =) G t=) (=) - & rt 2 6 * 
bm me HO +-OM — Ww LU a WoW LU +t ke ee 
<— < @ ors ji |i il < <I LU > ~~ J < Cw at < 

- OF ~~ BOIrnrwt -—- ~ OwN<t awe te OOO EE YQ Ow eh Ow ONIN 

SS et OA Wi ss TsMOSt MM NOGNSZRan 


ec re ee eww Zz w+ D> = ae +~—+OD+ D Lot DD ee ae 
+UOrCHOrRNAMNT Zee TWO OU>< WiZUgmaOOsa Ol e-COOGOGawag 
Zidke ere IINNNORTke OF DOOR AD sini fo Ded ter eer TOME DDD 
im oe Ce SAnCint Io K~KCONNNtenn dt K~SZ Wo Re he~w ~~ RRO 
FRILFOULWULLOWAO HWWibLesNOUW SY Sete JONCOZREUL LOWWWSe 


<TD ZN KIM NN Oe ITN DGD <a DO MS ZDONODW Se OOO OL 
| rod 

© mW © N Wm © N — wv im, © oi oo@e@ 

_ wt NY N N Wt ) eo Mm m wb LVO rh 


91 





Biblio GRA HY 


lI. Taube, M., "Progress in the Design of Information Retrieval 


Systems, '' Advances in EDP and Information Systems, American 
Management Association Report No. 62, 1961. 


2. Bourne, C. P., Methods of Information Handling, Wiley, 1963. 


Pecan, G, “4., Mxperiments in Large Scale Computer Direct 


Access Storage Manipulation, Computer Science Group, University 
of Washington, Seattle, Technical Report No. 69-1-01, January 8, 


1969, 


pmeescadow, ©. T., The Analysis of Information Systems: An Intro- 
duction to Information Retrieval, Wiley, 1967. 


petiays, D. G., ‘Research Procedures in Machine Translation", 


Natural Language and the Computer, p. 183-214, McGraw-Hill, 1963. 


6. Harper, K. E., "Dictionary Problems in Machine Translation, " 


Natural Language and the Computer, p. 215-222, McGraw-Hill, 1963. 


7 Garvin, P. L., "Syntax in Machine Translation, ' Natural Langu- 
age and the Computer, p. 223-232, McGraw-Hill, 1963. 


8. Mersel, J., ‘Programming Aspects of Machine Translation, '' 


Natural Language and the Computer, p. 233-251, McGraw-Hill, 1963. 


9. Salton, G., "A Comparison Between Manual and Automatic In- 
dexing Methods, '' American Documentation, v. 20, no. 1, p. 61-71, 
January, 1969. 


imeitavic, bo, £,, ‘Analytic Intormation Retrieval, '' Natural 


Language and the Computer, p. BO=soo ieGraw-Hill, 1963. 


11. Green, B. T., and others, ''Baseball: An Automatic Question 
Answerer, '' Computers and Thought, p. 207-216, McGraw-Hill, 1963. 


12. Weissman, C., LISP 1.5 Primer, p. 5-22, Dickenson, 1968. 


13. Computer Facility, Naval Postgraduate School, CP/CMS User's 
Guide, September, 1969. 


14, Becker, J. and Hayes, J. M., Information Storage and Retrieval: 
Tools, Elements, Theories, Wiley, 1963. 


we 





15. Williams, W. F., Principles of Automated Information Retrieval, 
The Business Press, 1965. 


16, Artandi, S., An Introduction to Computers in Information Science, 
The Scarecrow Press, 1968. 


iMeomeJanker, F., Indexing Theory, Indexing Methods, and Search 


Devices, The Scarecrow Press, 1964. 


18. Swets, J. A., "Effectiveness of Information Retrieval Methods, "' 
American Documentation, v. 20, no. 1, p. 72-89, January, 1969. 


19. Kellogg, C., ''The Fact Compiler: A System for the Extraction, 
Eeorace, and Retrieval of Information, "' Proceedings of the Western 
Weim Computer Conference, vol. 17, p. 73-82, 1960. 


Come iantin, J., Design of Real-Time Computer Systems, Prentice- 
Hall, 1967. 


el Withington, D. G., The Use of Computers in Business 
Organizations, Addison-Mesley, 1966. | 


cs 





PubiiraAtL DistTRISUTION LIST 


No. Copies 
Defense Documentation Center a 
Cameron Station 
Alexandria, Virginia 22314 
Pabprary, Code 0212 Z 
Naval Postgraduate School 
Monterey, California 93940 
Pec Gary &. Kildall, USNR ] 
Department of Mathematics (Code 53) 
Naval Postgraduate School 
Monterey, California 93940 
Wenn ichard J. Petrucci, USN ] 


Patrol Squadron 16 
Naval Air Station 
Jacksonville, Florida Seale 


ee 





UNCLASSIFIED 


Security Classification 







DOCUMENT CONTROL DATA-R&D 


(Security classification of title, body of abstract and indexing annotation must be entered when the overall report is classified) 


2a. REPORT SECURITY CLASSIFICATION 
Unclassified | 






1 ORIGINATING ACTIVITY (Corporate author) 
Naval Postgraduate School 
Monterey, California 93940 





2b. GROUP 





BR RESP ORT TTT Ue 


f Application of a Data Structuring Concept in a General-Purpose Fact-Retrieval 
System 


4 OESCRIPTIVE NOTES (Type of report and,inclusive dates) 


Meteies esis; September 1970 


S$. AUTHOR(S) (First name, middle initial, last name) 


Richard Joseph Petrucci 


. TOTAL NO. OF PAGES 7b. NO. OF REFS 
94 ZI 


9a. ORIGINATOR’'S REPORT NUMBERIS) 











6. REPORT DATE 


September 1970 


Ba. CONTRACT OR GRANT NO. 





Sate 





Pe sto 


b, PROJECT NO. 


> OTHER REPORT NO(S) (Any other numbers that may be assigned 
this report) 


d. 
110. DISTRIBUTION STATEMENT 
This document has been approved for public release and sale; its distribution 
is unlimited. 


11. SUPPLEMENT ARY NOTES 12. SPONSORING MILITARY ACTIVITY 
Naval Postgraduate Schovl 
Monterey, California 93940 


13. ABSTRACT 


An on-line, general-purpose fact-retrieval system is presented which 
employs a classificatory data structuring technique. The technique embraces 
| the basic concept of hierarchical classification of data and provides users 
with multiple anenues of access to a data file. Additionally, the data file may 
be partitioned into unrelated data sets. 


— Se | : ~~: 2 ee 
OD ir".1473. | UNCLASSIFIED 
S/N 0101-807-6811 95 Security Classification A-31408 





UNCLASSIFIED 


Security Classification 


KEY WORDS LINK ¢ 


Information retrieval 
Hierarchical classification 
Indexing 

Fact retrieval 

Computer 


Storage 





OD Fe" 1473 (Back) UNCLASSIFIED 


S/N 0401-807-682) 
96 





Security Classification A-31409 








ay oe. —< 
PAMPHLET BINDER 


Syracuse, N. Y. 
— Stackton, Colif. , 








Thesis 
P458 


Caw 








121668 


Petrucci 


Application of a 
data Structuring con- 
cept in a generaj- 
Purpose fact-retrieva] 
System, 





